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Preface 


Quantum mechanics is no longer a revolutionary theory. In the more 
than 35 years since its inception it has become an established branch of 
physics. 

Students of quantum mechanics are saved trouble if they are not led 
through all the historical pitfalls, and instead acquainted from the very 
beginning with concepts, such as spin, that cannot be grasped except by 
quantum mechanical means. 

The first aim of this work has thus been to break away from the 
traditional presentation of the subject matter, by casting aside as much 
as possible reliance on arguments based on the correspondence with 
classical physics. 

With this in mind, certain terminological usages, such as calling a 
-function a ‘wave function” or referring to the particle occupation 
number representation as “second quantization,” have been eliminated 
deliberately, because teaching experience has shown that such terms are 
the source of misunderstanding or of misleading thinking habits that are 
difficult to eradicate once they are established. 

The second aim has been to emphasize topics of contemporary interest 
to physicists engaged in experimental research. 

There are excellent texts, for example the classics by Schiff or by 
Landau and Lifshitz, which treat standard problems, such as the energy 
levels of the hydrogenic electron or the eigenstates of the angular 
momentum operators, and no need was felt to compete with these 
treatments. Accordingly, standard problems have been included only if 
they were needed for illustration of basic concepts. 

On the other hand, topics such as time reversal invariance, or super- 
selection rules, or the interaction picture, are in most texts treated as 
highbrow and couched in group theoretical or field mechanical language 
inaccessible to many experimental physicists, although in fact these 
topics are elementary and should be treated as such. 

Tn this sense the present work is an attempt to present advanced 
quantum mechanics from an elementary point of view. 
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This need not disqualify it to serve as an introductory text. The content 
of this work has actually been given, in the author’s opinion successfully, 
together with assigned problems and numerous references as an intro- 
ductory two-year course for graduate students at The University of 
British Columbia. 

Habits die hard, however, and the author realizes that in fact this work 
may find its most common use as a supplementary text to the more 
standard treatments. 


Vancouver, Canada F, A. KAamMpFFEeR 
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PREFACE, containing a statement of aims, precedes this 


TaBLE oF ConTEnTs, designed as a coherent abstract in an attempt to 
provide the prospective reader with the orientation in which 
he will have to traverse the subject matter. This is followed 
by a 


Prelude, in which are set forth briefly those aspects of physical reality 
that cannot be accommodated within classical mechanics, 
leading to the motivation for describing in 


Section | Pure States in terms of state vectors allowing a probability 
interpretation which constitutes one of the basic postulates 
of quantum mechanics. Using the dichotomic spin variable as 
example, one is led naturally to introduce in 


Section 2 Observables as operators in the abstract space spanned by 
the state vectors, illustrated by construction of the Pauli 
matrices representing the three cartesian components of spin. 
In 


Section3 Transformations in State Vector Space That Leave the Phy- 
sical Content of Quantum Mechanics Invariant are found, 
namely unitary and anti-unitary transformations, which are 
recognized as suitable for representing symmetry operations, 
with the operation of rotation in spin space singled out as 
preliminary example. As an alternative to the state vector 
description in 


Section 4 The Density Matrix is considered, and constructed for the 
case of spin 1/2 in terms of the polarization vector. It is 
stressed that the density matrix can grasp states which do not 
possess a state vector description, and the statistical ensemble 
is briefly discussed as a model for such a situation. The con- 
cepts developed up to this point should enable one to enter in 


*Section 5 The Theory of Selective Measurements, which contains 
the full formal development of quantum mechanics based 


*Sections preceded by an asterisk may be omitted on first reading. 
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solely on analysis of experimental situations realized by 
selective measurements, with compound spin orientation 
experiments serving as illustration. One is led in this context 
to consider in 


“Section 6 The Representation of Nonselective Measurements, 


requiring introduction of the concept of randomly distributed 
phases of measurement symbols. Once again idealized spin 
orientation measurements are used for illustration. 


Section 7 The Fundamental Dynamical Postulate governing the evo- 


lution of an object in time is stated in terms of the temporal 
development of expectation values, and from this formula- 
tion are derived state picture (or Schroedinger picture), 
operator picture (or Heisenberg picture), and interaction 
picture, together with the unitary transformations that inter- 
connect them. The resulting dynamical equations are stated 
in both state vector and density matrix language, and solved 
by various methods for simple examples, the spin magnetic 
resonance problem being treated in detail. To enlarge the 
applicability of the concepts developed thus far, in 


Section 8 The Representation of Observables with Nondenumerably 


and Denumerably Infinite Ranges of Possible Values is sketched 
and the concept of the ;/-function of a state introduced, with 
emphasis on its geometrical interpretation as transformation 
function in Hilbert space. With this preparation one can now 


grasp in 


Section 9 Displacements of the Observer, leading in particular to an 


explicit representation of the Galileo transformation. By 
exploiting the correspondence of the resulting transformation 
formulae with the classical formulae, the momentum of an 
object is recognized as generator of the displacement opera- 
tion, enabling one to derive the equation governing the 
y-function of an object in coordinate representation. The 
commutation relations between coordinate and momentum 
operators are found and used to introduce in 


*Section 10 Uncertainties and the Relations between Them. 


Heisenberg’s relation is derived, the concept of optimum 
state explained, and expressions for such states are cal- 
culated. The results of Section 9 enable one further to indulge 
by making in 
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"Section || A Digression on Superfluidity, in which the concept of 7 
quasi particles is introduced phenomenologically, with an eye 
on a more rigorous derivation from particle theory given later 
in Section 30. The main thread of argument is then picked up 
again by considering in 


Section 12 Rotations of the Observer, leading, in analogy to the 77 
displacement operation, to recognition of the component 
operators of angular momentum as generators of rotations 
around corresponding axes. The problem of spin magnetic 
resonance is taken up once more and solved again, this time 
by transforming the dynamical equation to and from suitably 
chosen rotating coordinate frames, The fundamental theorem 
governing eigenstates of angular momentum is stated and its 
proof relegated to Appendix 1. By exhibiting in 


Section 13 The Connection between Invariance Properties of the 83 
Hamiltonian and Conservation Laws the ground is prepared 
for explaining in 


Section 14 The Invariance under Inversion of Coordinates and the 87 
Law of Conservation of Parity. The expectation value of any 
observable odd under inversion is shown to vanish in states of 
definite parity, and some physical consequences of this 
theorem are examined. The evidence gained from experiments 
involving the B-decay of nuclei is discussed in terms of non- 
conservation of parity, and the necessity of considering 
“combined inversion’’ as a symmetry operation is pointed 
out. By contrast, as shown in 


Section I5 Invariance under Reversal of Motion does not lead to a 99 

conserved quantum number analogous to parity, because of 
the anti-unitary nature of this so-called time reversal trans- 
formation. States can, however, be labeled by a dichotomic 
quantum number according to their behavior under re- 
peated time reversal. The twofold degeneracy of all states 
that change sign under repeated time reversal is established, 
and the concept of the superselection rule is introduced. 
Eigenstates of angular momentum are used as illustration for 
both Sections 14 and 15. 


Section 16 The Particle Concept in Quantum Mechanics is explained 5 
as an abstraction far removed from the naive particle concept 
of classical physics, enabling one to grasp by introduction of 
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the quasi-particle concept properties of macroscopic objects 
with the same formalism originally designed for description 
of so-called elementary particles. The division into fermions 
and bosons is defined. The exclusion principle is stated and 
used to describe in 


Section 17 Fermion States in terms of a dichotomic occupation 
number of quantum states, leading naturally to introduction 
of creation and annihilation operators and their representa- 
tion. Neutrino and antineutrino states are used as simplest 
possible examples, and the possibility of understanding 
conservation of lepton numbers as superselection rules is 
pointed out. By an analogous development in 


Section 18 Boson States are introduced and illustrated by a deriva- 
tion of the equations governing transverse photon states. 
Particular attention is given to the description of photon 
polarization, and by a formal generalization one is led to 
consider scalar and longitudinal photons. With this prepara- 
tion in 

Section 19 Electrons and Positrons are treated as simplest examples 
of fermions which do not have firm parallel or antiparallel 
alignment of spin with momentum because of their non- 
vanishing mass. The charge conjugation symmetry of the 
theory is exploited to circumvent introduction of negative 
energy states, and after positron states have been recognized 
as spatially inverted electron states the conservation of lepton 
number is once again viewed as a possible superselection rule. 
A procedure reducing four-component -functions to a two- 
component form is explained. 


Section 20 The Lack of Sufficient Reason for Actually Existing Inter- 
actions is exhibited by recording the ambiguities inherent in 
attempts to account for the peculiar form of the electro- 
magnetic interaction by making the phase of #)-functions non- 
integrable. The requirement of invariance under phase trans- 
formations leads one to consider in 


*Section 21 The Idea of the Compensating Field, whose adoption re- 
quires recourse to vector mesons as the only primary agents 
of interaction between fermions. Attempts at removing 
inconsistencies encountered with interactions of short range 
are mentioned. Because of its long range one is tempted to 
view, as is done in 
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“Section 22 Gravitation as a Compensating Field. This is possible, in 
analogy to the gauge theory of electromagnetism, provided 
one takes into account the one basic difference between 
gravitation and electromagnetism, namely that inertial 
frames in presence of gravitation are, in general, accelerated 
with respect to each other if they are some finite distance 
apart. 


Section 23 The Starting Point of Quantum Electrodynamics serves as 
an opportunity to introduce the idea of the Feynman graph, 
and the concept of the scattering operator for which an im- 
plicit expression is found by introduction of the time ordering 
operator. Evaluation of this expression is attempted by 
developing in 


Section 24 Perturbation Theory and the Propagator Concept, 
which has as its central theme a theorem by Wick, and is 
accompanied by further elaboration of the Feynman graph 
concept, and its manipulation in momentum space. 


*Section 25 The Hierarchy of Propagators in presence of interaction 
all but forces one to view the world subspecie aeternitatis as 
an infinite concatenation of propagators, whose analytic 
penetration presents formidable difficulties which have as yet 
not been overcome. In absence of any internally consistent 
dynamical theory of interactions, one can fall back as in 


Section 26 On Selection Rules Due to Symmetry under Inversions 
and Rotations of Coordinates. The system of two photons is 
treated as an example of how to determine the intrinsic 
parity of objects, such as positronium, for which this concept 
can be defined without ambiguity. The two-photon state is 
also particularly suited to illustrate, as is done in 


Section 27 Permutation Symmetry of Multiple Particle States. 
Part of this section is devoted to construction of explicit 
representations for the operator of transposition of particle 
labels in angular momentum representation and in particle 
occupation number representation. 


Section 28 Some Consequences of Symmetry under Particle Con- 
jugation and Time Reversal are derived by exploiting con- 
sistently the anti-unitary nature of the corresponding sym- 
metry operators. The conditions are specified under which one 
can introduce the concept of conjugality and consider it a good 
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quantum number. The principle of reciprocity is proved and 
various conditions are examined under which this leads to 
validity of the principle of detailed balance. 


Section 29 Attributes Characteristic of Objects Engaging in Strong 
Interactions, such as isospin, baryon number, strangeness or 
hypercharge, require re-examination of the symmetry opera- 
tions introduced in Sections 26, 27, 28, and lead to selection 
rules different from the ones encountered in the preceding 
sections. The section closes with an examination of all possible 
attributes conserved through superselection rules generated 
by anti-unitary symmetry operations. 


*Section 30 The Quasi Particle Concept emerges as a powerful tool 
when one wishes to grasp the intrusion of quantum mechanical 
principles into the domain of macroscopic phenomena, such as 
superfluidity and superconductivity, encountered in sub- 
stances at low temperatures. Some aspects of the quantum 
mechanical N-body problem, using the nonperturbative 
methods of Bogoliubov, Cooper, and Beliaev, are reviewed 
with the aim to exhibit the crucial part played by the 
principle of superposition of probability amplitudes in the 
minimization of energy through formation of quasi particles 
in presence of interactions between ordinary particles. 


Appendix | The Eigenstates of Angular Momentum are needed so 
often in the course of this work that their representation in 
both angular momentum space and coordinate space have 
been reviewed and collected in this appendix for ready 
reference. A similar need is accommodated by summarizing in 


Appendix 2 The Addition of Two Angular Momenta, containing 
tables for the Clebsch-Gordan coefficients for addition of an 
angular momentum j,=1/2 and an angular momentum 
jz = 1, respectively, to an arbitrary angular momentum jj. 


Appendix 3 Vector Spherical Harmonics are defined and used for 
classification of photon states, giving rise to additional 
information on the subjects treated in Sections 18 and 27. 


Appendix 4 The Invariance of Dirac’s Equation under Lorentz Trans- 
formations is treated mainly for the purpose of deriving a well- 
known formula, governing the structure constants of this 
transformation, which is needed in Section 22. 
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Appendix 5 The Most General Canonical Transformation of a Pair of 
Fermion Operators is derived explicitly, following a develop- 
ment of Koppe and Miihlschlegel which deserves the widest 
possible attention, with particular reference to Sections 27 
and 30. 


Appendix 6 The Delta Function and Its Application to Phase Space 
Considerations has been included as possible aid to the under- 
standing of Sections 23 and 24. 


Appendix 7 If Galileo Had Known Quantum Mechanics he would 
presumably have understood the strict conservation of mass 
in nonrelativistic mechanics as consequence of a curious 
superselection rule, following from invariance under Galileo 
transformations, which was, in fact, first discovered by 
Bargmann. 
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Prelude 


Quantum mechanics purports to be a description of physical reality 
which deliberately eliminates from theory all features not demanded by 
experiment. That there should be any need at all for special efforts to 
accomplish this obviously sensible aim is partly due to a peculiar feature 
of human language. 

Language has been largely fashioned after macroscopic models. For 
example, if the word “particle” is used as a subject to which various 
physical properties are attributed, one notices the subversive effect of 
language when one tries to completely avoid the surreptitious use of 
some mental image of a “‘particle”’ between measurements. However, to 
refrain from using inappropriate mental models is just the kind of 
intellectual asceticism demanded by quantum mechanics. 

Development of quantum mechanics was forced by the recognition of 
two distinct experimental aspects of physical reality which cannot be 
accommodated within the framework of classical mechanics. 


(1) Beginning with the turn of this century it transpired that some 
dynamical quantities, which in classical mechanics are always accorded 
a continuous range of values, may in fact assume only certain discrete 
values. Any physical quantity A which assumes only discrete values 
@y, .., Gp, -.. is said to be “quantized,” Examples are the energy values 
of an electron bound to an atomic nucleus, and the intrinsic angular 
momentum or “‘spin”’ of an elementary “‘particle.”’ 

Although temporary amends were made by imposition of more or less 
mysterious ‘‘quantum rules” on classical mechanics, these artifices 
became untenable when one finally had to face in atomic physics yet 
another fundamental aspect of reality, namely 

(2) the impossibility of simultaneous exact determination of the 
totality of physical attributes of an object. Observation of atomic 
phenomena and their logical analysis, in particular, led to the discovery 
that 

(a) the interaction between the object of a measurement and the 
measuring apparatus cannot be indefinitely weakened, and 
1 
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(b) the disturbance produced by the interaction object-apparatus 
isonly statistically predictable and cannot, therefore, be compensated. 
Thus measurement of one attribute of an object can produce un- 

controllable changes in the value previously assigned to another attribute 
of the same object. It is therefore meaningless to simultaneously assign 
numerical values to all attributes of an object. 


These facts governing the atomic domain are inconsistent with the 
classical theory of measurement which is based on the belief that the 
interaction object-apparatus, if it cannot be made negligibly small, can 
at least be taken into account precisely and can thus, in principle, be 
compensated. 

Two physical attributes A™ and A® will be called “compatible” if 
measurement of one does not affect the value assigned to the other by a 
preceding measurement. Examples of compatible attributes are: The 
absolute value J and one component, J; say, of the angular momentum 
of an object. The three components of the momentum of an object. The 
energy and any one other conserved quantity in a closed system. 

Two physical attributes A and B which are not compatible are called 
“incompatible.”” Examples of incompatible attributes are: any two 
components of angular momentum of an object; momentum and parity 
of an object, provided the momentum does not have the value zero; 
strangeness and conjugality of an elementary particle, provided the 
strangeness is not zero. 

Every physical system will now be assumed to possess a complete set 
of compatible physical attributes A”...A, so that any two of these 
attributes are compatible and that no other attributes exist which are 
compatible with every member of the set. This assumption is not trivial, 
because at present there exists no experimental criterion which would 
allow one to determine whether a compatible set is complete. For 
example, it was thought for a long time that the complete set of attributes 
of an electron consisted of momentum p, mass m, charge e, spin 8, 83. 
Recently yet another attribute of the electron has emerged, the lepton 
number L, which is compatible with all members of the set p, m, é, 8, 83. 

It should be stressed again that the number of physical attributes in 
one complete set A is, in general, much smaller than the number of all 
possible physical attributes of an object. One can usually find other 
complete sets B, C, ... which are mutually incompatible. In the example 
above 8, can be replaced by a component of spin other than sg, resulting 
in a different complete set. 

A “complete measurement” on an object means a set of observations 
enabling one to ascribe definite values a{ to a maximum number of 
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compatible attributes A, Since any experiment designed to find the 
value b of another attribute B will now affect one or more of the previously 
established values a\ in an uncontrollable way, any complete measure- 
ment realizes the optimum state of knowledge about a given object. 

If the set of compatible attributes A™ of an object is known to have 
the values a”, the object is said to be in a “pure quantum state” or 
simply “state” characterized by the quantum numbers a‘"), One is thus 
led to search for a description of the state of an object, containing all the 
information represented by the set of numbers a‘"), and containing only 
that information. 
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Pure States 


The basic mathematical concepts needed for the description of the 
state of an object are most simply developed for idealized physical 
systems in which any physical attribute A can assume only a finite 
number of discrete values or quantum numbers @, ..., @,. 

In reality many physical attributes are, of course, even in atomic 
physics, capable of assuming a continuous range of values. To grasp such 
continuous sets of quantum numbers requires some mathematical sophis- 
tication, however, which might tend to obscure the basic simplicity of 
the quantum mechanical formalism, and for this reason they will be ex- 
cluded from consideration temporarily, to be taken up later in Section 8. 

A very simple state is the one depending on a dichotomic attribute, i.e. 
an attribute which can have two values only. Examples of dichotomic 
attributes are: the electron spin component in direction of an applied 
magnetic field ; the electric charge that may be carried by a nucleon; the 
parity of a set of pions in the center of mass frame ; the number of fermions 
in a given quantum state. 

As a starting point consider an experimental situation in which a spin 
4 associated with a magnetic moment is known to be aligned in a given 
direction, which may be sufficiently characterized by two polar angles 
#,. Such a spin state can always be prepared by performing a Stern- 
Gerlach type of experiment with the external field in direction 9, 0 and 
letting only the appropriate component of the split beam emerge from 
the apparatus. This state will be represented by a two-dimensional 
complex unit vector or “state vector” denoted 


uy mG) 


where the two complex numbers «, and A are functions of the direction 
#,@ of the spin. The label “a,” shall represent the quantum number 
“spin up in direction #, 9.” The requirement that |a,> be a unit vector, 
meaning that the scalar product of |a;> with its hermitean conjugate, 
denoted 

ae 


(1.2) <a;| = af Bt 
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be unity, namely 


—_— 
(1.3) <a,|ay> = af (3) = |ors|*+|Bs|? = 1 

etl B L 
qualifies the quantities |a,|? and |8,|* for a probability interpretation 
which will be arranged to suit the following experimental fact. 

Tf on an atomic beam with electronic spins } aligned in direction 3, 9 
and represented by the state vector |a;> another Stern-Gerlach experi- 
ment is performed with external field in parallel with the z-axis, this 
observation of the z component of spin interferes with the state |ay> 
such that some spins are aligned parallel to the z-axis and some anti- 
parallel to it. The outcome of this experiment is, in principle, only statis- 
tically predictable. Thus, there are required two numbers, adding up to 
unity, one of which is the probability for finding the spin aligned parallel 
to the z-axis and the other the probability for finding the spin aligned 
antiparallel to the z-axis. This suggests arranging the dependence of a 
and f, on # and ¢ such that |o,|® and |f;|* can be identified as these 
probabilities, respectively. With this convention the states |a,.> = (0) and 
|a_> = () represent situations in which the spin in z direction has with 
certainty the values +1 and —1, respectively. [In the following, com- 
ponents of a state vector |a;> will generally be denoted (a;),. With this 
convention «1 = (ay); and By = (a1)2.] 

In generalization of this probability interpretation it will now be 
assumed that the projection of a state 


(a), (b)s 
ja> = , on another state |b> = | 


represents the amplitude for finding upon measurement the quantum 
number b if the object is known to be in the state |a>, where the projection 
is defined as 


(1.4) <bla> = (b)F (a), + (b)E (aot... = <alb>*. 


Thus, |\<b\a>|® shall be interpreted as the probability for observing the 
quantum number b, if the object is known to be in a state characterized by the 
quantum number a. For example, if |a,> = (0) is the state in which the 
spin is with certainty parallel to the z-axis, and |a,> = (3') is the state in 
which the spin is parallel to the direction #, 9, then 


<ay|ar> = 1.0, +0.81 = m4, 


and |q,| is the probability for finding the spin +1 in z direction if the 
object is in the spin state |a1). 
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This basic postulate of quantum mechanics enables one to infer from 
the dichotomic nature of electronic spin existence of another unit vector 


ae es) 2 2 
1.5 a> = = ; ay |? + =1 
So |a2) (-) (a las|*-+ Ba! 

representing the state of spin antiparallel to the direction #,@, which is 
orthogonal to the vector |a1>, 

(1.6) <az|ay> = 0, 

because if the object is known to be in the spin state |a,), the probability 
for finding the spin in opposite direction is zero. The two possible spin 


states that can be found upon observation of spin in a given direction 
#, p are thus represented by a system of two orthogonal complex vectors, 


(1.7) <a;laq,> = 8, — (t, & = 1,2). 


Since observation of the spin in a given direction #, is a complete 
measurement and realizes the optimum knowledge attainable about 
electronic spin, the system of basis vectors |a,>, |a2> should be complete, 
i.e. satisfy the closure relation 


(1.8) x la><a| = 1, 


the identity symbol I being defined by <6|I|a> = <b|a) for any vectors 
|b>, |@>. Indeed, if one considers measurement of spin in some other 
direction #’, 9’, symbolized by two orthonormal vectors |b,>, |b>, the 
basic interpretation postulate of quantum mechanics requires that these 
vectors can be written as linear superpositions 


(1.9) |b.> = |a1> ere + [42> Cox = |a1> <aa|b,> + [42> <ao|0,>. 
From this follows by projection and after use of (1.7) 
(1.10) <b;|b,> =| x <bj|a,> <a;|b,> 


which is consistent only if relation (1.8) holds. 

The probability interpretation, the orthonormality condition, and the 
closure relation are not affected if any state vector is multiplied by a 
number of modulus unity. Thus, the same physical state |a> is described 
by all vectors e'\a>, where « is any real number. 

The concepts developed here for the special case of the spin } are easily 
adapted to any state depending on a dichotomic variable, Thus the state 
vector 


(1.11) la» = (5): [alt +1el* = 2 
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will describe the charge state of a nucleon, if |«|* is interpreted as the 
probability for finding the nucleon as a proton, |B|* as the probability 
for finding it as a neutron. Similarly, the same symbol can describe the 
occupation state of a fermion quantum state, if |«|* and |8|* are inter- 
preted as the probabilities for finding that state empty or full, respec- 
tively. 

Extension of these notions to states depending on attributes that may 
have more than two values proceeds without difficulty. A simple example 
is the charge state of a pion, which requires for its description a state 
vector having three components 


a 
(1.12) |a> = (): Ja|?-+|B|?+[y|* = 1, 
v4 

because according to experiment a pion may have positive, zero, or 
negative electric charge. In expression (1.12) a, 8, y are taken as the 
probability amplitudes for finding the pion positive, neutral, negative, 
respectively. 

Additional compatible attributes of an object can always be accom- 
modated by extending the dimensionality of the state vector space. 
Formally, this is done by forming the direct product of any two state 


vectors, namely 
[af], (a): 
|a{?> x |ap> = («) x («#) 


fa), (Ph 
(ai), (af Je 


[a Je fa": 
[al J [ale 


(1.13) [af aly 


i] 


where |a{) is the state in which the attribute A™ has the value ai), and 
ja the state in which A® has the value a”. For example, a nucleon 
possesses two dichotomic attributes, spin and charge. The combined 
spin and charge space is spanned by four unit vectors, which in a self- 
explanatory notation may be written |pt>; |p1>; |nt>; |m}> and 
represented by the set of direct products 


1 
11> = (aee“(ahaw = (0): #4 = (6G), = 


ooo 
coors 
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NOTES 


Gerlach and Stern [1] were first to observe the splitting into two 
components of an atomic beam in an inhomogeneous magnetic field. 
This was later interpreted as being caused solely by the magnetic moment 
associated with the spin } of the remaining valence electron. 

Fermi [2] gives a lucid account of the use of vectors depending on a 
dichotomic variable, and uses them for the description of the charge 
state of systems of nucleons in particular. 

Dirac [3] invented the bracket notation used throughout this work. 
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a SECTION 2 


Observables 


The measurement of an attribute A on an object in a state |b) will, in 
general, interfere with the state |b> such that after observation of A the 
object will be found in one of the states |a;> which affix the value a; of A 
to the object. Which of the states |a,>, |a,>, ... will result from the 


measurement, i.e. which of the possible values aj, ds, ... will actually 
be observed in an individual measurement of A, can be predicted 
only statistically. 


Although it is thus not, in general, possible to associate a specific value 
a; of A with a given state |b>, an average value A of A will emerge if the 
observation of A is carried out on an ensemble of objects all in the same 
pure state |b>. 

For example, consider again the spin state |a,> = (#}), describing a 
spin o = 2s(|s| = 4) having with certainty the value +1 in direction @, 9, 
and suppose an observation of the spin component in z direction, o3, 
is carried out. The result of observation, performed by means of a Stern- 
Gerlach experiment with the magnetic field in z direction, will be the 
establishment of either state |a,> = (}) or state |a_> = (9), i.e. og will be 
found to have either the value +1 or the value —1, with probabilities 
|a,|* and |8;|*, respectively. However, if the average value of the spin 
transforms under rotations of the coordinate system as a vector, one 
should expect és to be simply the projection of the unit vector in (9, ¢) 
direction on the z-axis, namely 


(2.1) G3 = cost. 


Similarly, the average values of o, and oy in the spin state |a,> should 
come out to be 


(2.2) és 
(2.3) é = sindsin 9. 


sind cos 9 


The formalism which reflects the interference of observation with 
states and at the same time provides a means for computing the average 
value of an observable A in any given state |b) consists of representing 
an observable by a hermitean linear operator such that the eigenvectors 
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of this operator are the states |a;>, in which A has definite values a;, 
and such that the corresponding eigenvalues are the numbers a; them- 
selves, 


(2.4) Ala> = aa. 


Such a representation is possible because the eigenvalues of a hermitean 
linear operator form an orthogonal system that can be normalized. This 
representation of the observable A as a linear operator satisfying (2.4) 
is also convenient, because A will transform a given state |b>, in general, 
into another state |c> 


(2.5) |c> = Alb) 
such that the projection of |c) on |b) is the average value of A in the state 


[b>, 


(2.6) A = <b|c> = <b| Ald». 
To see this, expand the state |b> in terms of the basis vectors |a;> 
(2.7) |b> = |ay> <ay|b> + |az> az|b>+... 


where <a,|b> is the probability amplitude for finding the value a; of A 
in state |b>. As a consequence of (2.4) one has 


(2.8) A|b> = a|a1> <a,|b> + a2|a2> (ag|b> +... 
and thus 
(2.9) <b|A|b> = a,|<a4|b>|? + a2|<ae|b>|? +... 


which is, by definition, the average value of A, because each value a; 
contributes to A with the corresponding probability |<a;|b>|*. 
Equation (2.5) can be written explicitly in matrix and vector notation 


(c)y ‘Ay, Ayo .«..\ /(b)1 
(2.10) (¢)e = As Avy (b)s : 


The requirement that the average value of a physical observable be 
a real number restricts the representation of observables by operators 
to hermitean operators, i.e. A must satisfy the condition 


(2.11) At=A 
where the hermitean adjoint A* is defined by the equation 
(2.12) <Ate|b> = <c|A|b>. 


Applying the matrix notation (2.10) to this definition, one verifies the 
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operation of hermitean adjunction to be equivalent to transposing the 
matrix A and taking the complex conjugate of its elements, 


(2.13) Aj, = Aj. 
The reality of A for hermitean operators follows from 
(2.14) A = <b|A|b> = (Atd|b> = <b|.At|b>* = <b] A|b>* = A*. 


It is sometimes useful to keep in mind that, although any observable 
must be represented by a hermitean operator, the converse need not be 
true, i.e. a hermitean operator need not necessarily represent an observ- 
able. 

From the foregoing, it follows that the observable spin in direction 
#, @ must be represented by a hermitean operator, which may be written 
as a 2 x 2 matrix 


Ay, A 
(2.15) 0 = ley ea ; A4;, Age real; 


so that |a,> = (f:) and |a,> = (#:) are the two eigenstates of og, with 
eigenvalues +1 and —1, respectively, 

(2.16) 65,.|41> = +|a1)5 9, 0|42> = —|@2> 

or, in components, 

Ayia, +Ay2Bi = 0 Aja +Ais Bo = —a 

Alka; +AsoBy = By Af}as+AssBs = —Bs 


The necessary and sufficient conditions that these equations for «;, 8; 
have solutions are that the coefficients have vanishing determinants, 
namely 


(2.17) 


Ay, Age—|Aj9|?+1—Ay1—Aoe = 0; 


(2.18) 
Ay Age—|Aio|?+1+A11+ Age 


i} 


These equations are equivalent to the restrictions 
(2.19) Ay, +Ae2 = 0; Aj; Ax2—|Ajo|? +1 = 0 


so that o9,,. can be represented in the form 


A A 
2.20) opo=(4e _42)i Ait ldul?= +1; Anreal 


To obtain the dependence of «;, 8; and A,;, Aj» on the polar angles #, ¢, 


14 Concepts in Quantum Mechanics 


consider as a starting point the z component of spin og. The corresponding 
operator must be a special case of (2.20), obtained by putting 3 = 0, and 
will be written 


(2.21) a= (7 Dee Zi, +|Zre|2=1; Zr real. 
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The matrix elements Z,,, Z;, must be chosen such that the two states 


y 0 
(2.22) poe A la.) = (;) 
are the eigenstates of og with eigenvalues +1 and — 1, respectively, 
(2.23) os|a4> = +04; as|a_> = —|a_> 


or, in components, 
(2.24) Zy,=1, Zi = 0; Zy2=0, —-2, = -1. 
Thus o; is found to be represented by the matrix 
1 0 
(2.25) 03 = i _\): 


Tf one demands now in accordance with (2.1) that the average value of 
a in the state |a,> = (#) be cosd, one obtains for «, and f, the condition 


(2.28), = Casoslany = &_BE() _1)(f3) = lnal*—IBul® = cose 


which together with the normalization condition 


(2.27) |oy|?+ Ba]? = 1 
yields the solution 
(2.28) a, = e¥ cos (9/2); By = ext sin (9/2) 


with arbitrary phases ys and y. The phase of 8, has been written (y+) 
because any state vector is determined only up to a common phase of 
all its components in any case, and by convention the phase y will now 
be put equal to zero, so that |a,> becomes the special case of |a,> for 
0 = 0. Thus 


cos (9/2) 
(ek, Jay = (em ain can 


The remaining unknown phase y will now be determined by utilizing 
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conditions (2.2) and (2.3). Denoting the operators representing o, and 
oy by 


Xu =) 
2.30 = 5 
@sy a= (re zt 
aes ie se Xi + [X12]? = 1; Xi, real 
. YR -Yu)’ Yii+|¥is|? = 1; Yui real 


equations (2.2) and (2.3) read explicitly, if the average is taken for the 
state |a,> as given in (2.29), using some elementary trigonometry, 
& = <a,|o,|a,> 
= X,,cosP+4(X2.e%+ XheX) sind = sin#cos o 
& = <a,|o2|41> 
Yj, c0s0 + 4$(Yi.e%+ Vi,eX) sind = sin#sin @. 


(2.31) 


Since sin and cos# are linearly independent these equations require 


(2.32) Xn =¥un=0 
so that according to (2.30) X,. and Yj», must have modulus unity, 
(2.33) Xi = eff; Vy. = ei, 


Substitution into (2.31) gives the equations 
(2.34) Helxt®) + eAt)] = cos @ 
: Felt 4 ext] = sin 9. 


From the first of these equations follows 


(2.35) x+6=9 

which, substituted into the second equation (2.34) gives 
(2.36) Fele-O + elerr-O] = sin 9. 
This can be true only if 

(2.37) n-§ = —2/2. 


Thus one of the phases x, y, € remains undetermined, one chooses by 
convention 


(2.38) é=0 
so that 
(2.39) n = —7/2; x=9 


and therefore 
(2.40) eal = Fae 
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Applying the same line of reasoning to the state |a2>, one thus finally 
obtains the following representations in terms of the polar angles #, 9 


__ (cos (8/2) 
ie eve Bs See 

_ (—sin (8/2) e% 
oe >= (seer) 


ee ee ec 


The so-called Pauli matrices (2.43) are special cases of the general spin 
operator which, with help of (2.42) and (2.43), is found from Eqs. (2.17) 
and (2.20) 


saan 
(2.44) ae se sin} e ). 


sinfe'? —cost 


It is interesting to note that the operator og, and its “cartesian com- 
ponents” oj, a», og satisfy the same relation that holds between an 
ordinary vector pointing in direction #, 9 and its cartesian components, 


(2.45) 5,2 = 01 Sind cos gtogsindsin 9 +03 cos#. 

The Pauli matrices (2.34) have the algebraic properties 

(2.46) o,0,+0,0; = yl; 0102—020, = 2iog (cyclically). 
Introducing s; = $0; one finds the commutation relations (C.R.s) 
(2.47) 818,—8981 = 183 (cyclically). 


Such C.R.s will be recognized in Section 12 as a general property of any 
three operators that represent an angular momentum in quantum 
mechanics. 

The optimum information obtainable about an object is, in general, 
contained in any state vector that is a linear superposition of the 
simultaneous eigenstates |a{" af?) ...> of the set of compatible observables 
A, A®), .,.. The requirement that |a{!a{?)...> form a complete set can 
be met only if the operators representing compatible observables Am... Am 
commute, 


(2.48) AM Am Aim) 4M — 0, 


To see this, suppose |a{a> is a simultaneous eigenstate of A® and 
A with eigenvalues a” and af, respectively, so that 


(2.49) [Am A a A) AD] ja? al?» = (a? a? -aP al?) jaf? a?» =0. 
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Tf now |a{? a{?> are required to form a complete set, then any state |b> 
can be expanded 


(2.50) 1b> = DDewlaPaP; — cy = <a aby 
tk 

so that because of (2.49) 

(2.51) [A A® — A A) by = 0 

for all vectors |b>, which can be true only if 

(2.52) A A®_ 4 4M — 0, 


The general case (2.48) follows from this by induction. 

One may, of course, count two or more compatible observables as a 
single observable which upon observation yields two or more numbers. 
This freedom is reflected in the possibility of representing, in the space 
spanned by the direct products, |a{!> x a> the operator 


AM 4@ — 4 4a 
by the direct product 


(2.53) A= AM x 4® 


which is a diagonal matrix if A® and A are diagonal. This follows by 
straightforward computation from the following definitions. 
The direct product of two matrices A and B 


(2.54) C=AxB 
is meant to have the matrix elements 
(2.55) ‘iam = Ain Brn. 
The labeling of the rows in C is done in the sequence yj =11, 12,..., In, 
21, 22,..., 2n, ..., similarly for the columns. Thus the direct product of 
the matrices 
Ay, A By, B 
2.56 A = ( 11 ) Bee ( 11 _ 
aes Aw Aas Bu Bas 


(2.57) Gre Ay, Boy 
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The following statements are easily verified. 


(I) If A and B are square matrices, then C is also square. 
(II) If AA’ =A” and BB’ = B" and if Ax B=C and A’x B’=C' 
then also CC’ = A" x B” or (A x B)(A’ x B’) = AA’ x BB’. 
(III) The direct product of two diagonal matrices is again a diagonal 
matrix. 
(IV) The direct product of two unit matrices is again a unit matrix. 


NOTES 


Pauli [1] invented the description of dichotomic spin in terms of the 
three operators 04, 2, 03. 

The definition of the direct product used in this work is identical with 
the one given by Wigner [2]. 
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SECTION 3 


Transformations in State Vector Space That Leave the 
Physical Content of Quantum Mechanics Invariant 


Summing up the results of the two preceding sections one may say 
there are two quantities which represent the physical content of quantum 
mechanical states. 


(I) The absolute value of the projection of one state vector |a> on 
another state vector |b>, |¢b|a>|. It is identified with the probability 
amplitude for finding the value b of the observable B if the observable 4 
is known to have the value a. 

(Il) The average value of an observable A ina state |b>, A = <b| Ady. 
The observable A is represented by a linear hermitean operator A+ = A, 
whose eigenvectors |a;>, defined by A|a;> = a,|a;>, form an orthonormal 
basis, <a;|a,> = 8;., © |a;> <a;| = I, spanning the state vector space. 

i 


Obviously, these two experimentally accessible quantities are not 
affected by any transformation 


(3.1) |a> > |a’); |b> + |b; A+A’ 
such that 

(3.2) |<b'|a’>| = |<bla>| 

and 

(3.3) A = A. 


One possibility of effecting such a transformation consists of a change 
in the basis vectors which corresponds, geometrically speaking, to a 
rotation of the coordinate frame in state vector space. This linear 
operation U and its inverse U~! are defined by 
(3.4) 

|a> = Ula’); Ua = |a’); Ua) +f|b>) = aU a> +RU|o) 
with the understanding that (3.3) should hold, and that (3.2) be satisfied 
through the stronger condition 


(3.5) <b'|a’> = <bla) 
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which requires U to be a unitary operator, 

(3.6) UUt=UtU =I or Ga ha U; 

Condition (3.3) entails that A’ be obtained from A by the similarity 
transformation 

(3.7) A’ = UtAU. 


The transformed operator A’ retains the hermitean property of A. 
Another possibility of effecting a transformation (3.1) which satisfies 
(3.2) consists of performing the nonlinear operation @ defined by 


(3.8) O(a\a>+f|b>) = «* Olay +B* O|b> 
such that 
(3.9) <@a|Ob> = <alb>* = <bla>. 


Operators of this kind are called antiunitary operators. 

The importance of this invariance property of quantum mechanical 
description of reality resides in the possibility of representing symmetry 
operations in physical space by suitably chosen unitary or antiunitary 
operations in the abstract space spanned by the state vectors. The 
operation of inversion of spatial coordinates, for example, can be repre- 
sented by a unitary operator, whereas the operation of reversal of 
motion, formally equivalent to a reversal of the time axis, requires 
representation by an antiunitary operator, as will be shown later in 
Sections 14 and 15. 

Of immediate usefulness is the possibility, opened by the invariance 
under unitary transformations, of choosing as basis a set of vectors such 
that the matrices representing observables appear in the simplest 
possible form. In particular, it is always possible to diagonalize a hermi- 
tean matrix A by a unitary transformation. The diagonal elements of 
the thus transformed A are then the eigenvalues of A, and the columns 
of the transformation matrix U are made up out of the untransformed 
eigenvectors of the matrix A. 

To prove this, write A|a;> = a;|a;> in components 


(3.10) X Ann(@an = 4(4)m- 
Having then introduced the transformation U by 
(3.11) ja.) = Ulaiy; Uta,» = lai 
one deduces immediately from 

(3.12) 


a,ja;> = a,U+|a,> = Ut Alay = Ut AUU*|a,> = A’ ai 
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that the eigenvalues of A’ are the same as those of A, namely a,, the 
eigenfunctions being now |a;> = U+|a,>. By taking 


(3.13) ym = (dn)n ite. 
(41)1 (a)... (ay) (a:)F ... 
U= (41)2 (a2)2 ; Ut= (42)¥ (a2)z 
the eigenvectors reduce to the simple form 


<a, |a,> 3) 
(3.14) |ai> = Uta) = <as|a;> = 8; 


il 0 
ie, |a,> = (\); |a;> = (' ete. 


The demonstration that A’ is diagonalized by (3.13), with a; as diagonal 
elements, is now straightforward. One has 


(3.15) (AU) mi = XZ Amn Uni = D Amn(Gin = i(2i)m 


and therefore, because of the orthogonality of eigenvectors belonging to 
different eigenvalues, 


(3.16) (UF AU) = Y (G)A (Aim = 4: 8;p 


ay 0 


It is often useful to write a unitary operator in the form 

(3.17) U =e 

where the generator of the transformation, S, is a hermitean operator, 
(3.18) St+=8 

and the symbol eS stands for the expansion 

(3.19) eS = 1+i8 + (12/21) 82+... 
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As an example, consider the unitary matrix whose columns are the 
eigenvectors (2.41), (2.42) of the general spin matrix (2.44), 


cos (9/2) —sin ()/2) e~*? 
sin (9/2) e'? cos (8/2) ) 


cos (0/2) sin (9/2) e~"? 
—sin (0/2)e"? cos (8/2) ): 


By straightforward computation one verifies unitarity 


(3.20) U= ( 


ur = ( 


(3.21) UU+ = UtU =1 = (; 1) 
ot 
and the ensuing diagonalization of the general spin operator (2.44) 
1 0 
t= Utell = 
(3.22) o! = Ute = ft i 


with diagonal elements +1, —1, as expected. 

From the interpretation of #,9 as polar angles in physical space, it 
follows that the unitary operator (3.20) represents in abstract spin space 
a rotation of coordinates in physical space such that the z'-axis points 
in a direction described by the angles #, ¢ in the untransformed coordi- 
nate system. 

The representation of spatial rotations in spin space is seen to be 
double valued, i.e. rotation through an angle } = 27 does not regenerate 
the original states |a,> and |a2>, one finds —|a,> and —|a2) instead, and 
it requires a rotation through # = 47 to recover |a,> and |a2>. This does 
not lead to any inconsistencies with observation, however, because |a,> 
and —|a,> represent the same physical state. 

It will be shown in Section 12 that the rotation by angle ¢ around an 
axis with direction cosines «, B, y is represented by 


(3.23) U = exp id(as; +Ps, +83) where 8; = 0;/2. 
This may be written, using the expansion (3.19) and the C.R.s of the ;, 
(3.24) U = cos (p/2) + 2i(as; + B32 + 8g) sin (p/2). 
It must be kept in mind that 
exp [ip (ws; + B82+783)] # exp (idass) exp (ifBs2) exp (ipyss). 


NOTES 


Unitary operators have been widely employed ever since their signi- 
ficance in quantum mechanics was completely elucidated by Dirac [1] 
and Jordan [2]. 
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Antiunitary operators, on the other hand, although their importance 
for quantum mechanics was pointed out a long time ago by Wigner [3], 
did not acquire equal status with unitary operators until recently, when 
time-reversal symmetry has come to the forefront of experimental 
research in particle physics, 
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The Density Matrix 


An alternative way of apprehending the state of an object, free from 
the arbitrariness of phase which afflicts the representation of states by 
state vectors, consists of describing the state by a hermitean matrix 
M(b), whose elements are in terms of the components (6); of the corre- 
sponding state vector |b> 


(4.1) M,,(b) = (b); (b)f or = M = |b><|. 
The normalization condition ¢6|b> = 1 is then equivalent to the condition 
(4.2) trace M = 1 


the trace being the sum of the diagonal elements of M. Computation of 
the average value A of an observable A in the state characterized by 
M(b) is then accomplished by evaluation of 


(4.3) A = trace(MA) = trace(AM). 
Indeed, in the state |b>, 
(4.4) 
A= z x (6) Amn(O)n 
= DY Mam(b)Anm = X [M(b) Alun = trace (MA). 


From the representation (4.1), it follows that this so-called density 
matrix representing a pure state |b> is idempotent, 


(4.5) M’=M 
because one has, in components, for orthonormal state vectors 
(4.6) 


(M*)y, = o My Mp = x (b),(b)}*(b) (OYE = (6) ()E = My. 


A hermitean matrix can obviously satisfy (4.5) only if all its eigen- 
values satisfy the same condition, i.e. its eigenvalues must be either 0 
or +1. From (4.2), it follows then that only one eigenvalue is + 1 and all 
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others are 0, because M can be brought on diagonal form by a unitary 
transformation, which leaves the trace invariant, 


trace (U+ MU) = trace(MUU*) = trace M. 


Geometrically speaking, M(b) is thus a projection operator, which 
applied to an arbitrary vector |a> projects it in direction of the eigen- 
vector |b> belonging to the eigenvalue +1 of M. 

Since |b,> is the eigenvector of M(b,) with eigenvalue +1, whereas all 
other vectors |b,> with s 4 r belong to the eigenvalue 0, 


(4.7) M(b,)|b»> = [3 Ca¥ Oda] = Bulbs 
the expansion 

(4.8) |a> = z |b.> <b,|a> 

yields immediately the result 

(4.9) M@)Ia> = [b,><b,la). 


The projection operator M(b,) can thus quite literally be taken as a 
measurement symbol representing a selective measurement that accepts 
only those objects which possess value 6, of the attribute B and rejects 
all others. 

The average value of the density matrix M(b) in a state |a> is now 
equal to the probability for finding the value 6 of B if A is known to 
have the value a, 


(4.10) MW) = <a|M(b)|a> = <alb> <bla> = |<ald>|2. 


Since the reasoning of equation (4.4) is applicable to the hermitean 
operator M(b) in the state |a>, which in turn can be represented by a 
density matrix M(a), one has the interesting result 


(4.11) |<a|b>|? = trace [M(a) M(b)). 


The entire physical content of the quantum mechanics of pure states has 
thus been reduced to statements involving density matrices only. 


The state 
_ (#1) _ (cos (0/2) 
lo> = (st) = (an aia 
describing a spin which has with certainty the value +1 in direction 
0, 9 will again serve as an example. The corresponding density matrix is 
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(4.12) 
ae |a;|? oy BH\ _ (cos® (#/2) sin (9/2) cos (9/2) e"? 
= (-, ot at) “i (ca (8/2) cos (9/2) e"? —sin® (/2) ) 


which obviously satisfies trace M =1 and is idempotent, M?=M. A 
particularly instructive representation for M is obtained if one utilizes 
the possibility of writing any 2 x 2 matrix as superposition of the four 
linearly independent matrices 


yee ie 5 iy/iee0 he bie 
"iy a=(; 0) emer ye ay fe 


If M is required to be hermitean and have trace unity, it must be of the 
form 


(4.13) M = }(1+Ps) 


where P = (P,,P2,P3) is a vector whose three real components charac- 
terize the state. The idempotence of M can be satisfied only if P is subject 
to the condition 


(4.14) Pe =1 


which leaves for the description of the state two parameters, as expected. 
Indeed, from (4.13) and (4.14) follows, utilizing (2.46), 


M? = }(1+Po)(I+Po) 
(4,15) = F[1+2Po+ > Pioi+ DY P; Py (oj; 0, +0, 0;)] 
: (hay 
= }(1+2Po+P?J) = 4(1+Po) = M. 


Tn terms of the components «,, 8; the vector P is found by equating the 
elements of (4.12) and (4,13), yielding 


P, = 2Re(8, at) = sind cos 9 
(4.16) 


= 
u 


21m (8; a) = sindsin 9 
Jos|?—|B,|? = cosd. 


The unit vector P points thus in direction 9,9 and may be called the 
polarization vector of the state. The unit length of P, derived from the 
idempotence of M, reflects the total polarization of the object in a pure 
spin state. 

So far nothing has been said that cannot be expressed in terms of state 
vectors and their components. However, the quantum mechanical des- 
cription of states in terms of density matrices derives its particular 


a 
fi 


28 Concepts in Quantum Mechanics 


profundity from the possibility of giving M a meaning even if the object 
is not in a pure state, so that the decomposition of the matrix elements 
of M after the fashion (4.1) is, in principle, not possible. The density 
matrix description can grasp states which do not possess a state vector 
description. 

A model of such a situation is obtained if one imagines the object of 
measurement to consist of a statistical ensemble composed of N sub- 
systems, which may be in different quantum states. Suppose the NV 
subsystems are in a number of possible quantum states described each by 
M,, = M(b,). The average values A, of the observable A must then be 
averaged once more over the ensemble before one arrives at an expression 
for the maximum information about A obtainable in this case, namely 


(4.17) 4 = yp,A, 
with 
(4.18) LPn=1; MeO 


where p,, is the probability for finding the value A,, in the ensemble. 

The formalism enabling one to compute the average value of an attri- 
bute in a pure state using the density matrix can be extended to this 
general case by introduction of the generalized density matrix 


(4.19) M =X p,N,, 


which satisfies, because of (4.18), the condition 
(4.20) trace M = 1 
and yields the desired result 


(4.21) A= trace(MA) = ¥ p, trace(M, A) = D p, Ay. 
n n 


This generalized density matrix is no longer idempotent, however, unless 
all M,, are equal to M. Idempotence of the density matria is thus a criterion 
for the presence of a pure state. To prove this, consider a mixture of two 
systems, described by density matrices M, and M, so that 


(4.22) 
M=p,M,+p2My  (0<91< 1) (0<pe<1) (pi +p2 = 1). 
The square of this matrix 


(4.23) M? = pi Mi +p3 M3 +pipo(M,M.+M.M)) 
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can, because of the identities 

(4,24) M,M,.+M,M, = M?+M3-(M,-M,)? 
and 

(4.25) Pit pips = PilPit Ps) = P13 PET Pi Ps = Ps 


be written in the form 


(4.26) M* = p, Mj +p2,M3—p;p(M,—M,)*. 
Since M, and M, are idempotent, one has 
(4.27) M- M? = p,px(M,—M,)*. 


On the right-hand side of this equation stands a positive matrix which 
can vanish and thus lead to idempotence of M only if (M,—M,)? = 0. 
The square of a hermitean matrix can vanish only if all its elements 
vanish, and therefore M, = M, isthe necessary condition for idempotence 
of M. The general case follows now by induction. 

It may be worth noting here that in quantum statistical mechanics 
2'= —(log M) = —trace(Mlog M) is the entropy of the statistical en- 
semble represented by M. It vanishes for any pure state and only for a 
pure state. It is rather remarkable that one can bypass in this fashion 
the entire notion of “phase space,” familiar from classical statistical 
mechanics, which is not a meaningful concept in quantum mechanics, 
because momenta and coordinates are, in general, incompatible attri- 
butes. 

It should be stressed that the ensemble picture of a state which is not 
pure becomes a fictitious model when the experimental situation is such 
that optimum information about subsystems of the entire object is, in 
principle, not available. For example, consider as object a partially 
polarized beam of electrons, so that, in the ensemble model, one would 
introduce a density matrix 


(4.28) M = XP My = ZX pa(+P, 6) = 31 +Pc) 

where the average polarization 

(4.29) Pia DPrPr 

is no longer a unit vector, M not being idempotent. The absolute value 
|P| represents the degree of polarization of the beam, |P| = 0 means the 


beam is unpolarized, and |P| = 1 designates the limiting case of full 
polarization. If P is the only observable parameter of the object, the 
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decomposition (4.29) becomes fictitious, because the entire experimental 
information available resides in 

(4.30) M = }(I1+Po) 


making the mental image of a set of electrons each having definite orien- 
tation of spin, represented by the unit vectors P,,, fictitious and, from an 
operational point of view, undesirable. 


Note in this connection the general result 
6; = trace (a; M) 


trace o;+ trace P; +9; on SP = 
i 
(kA, j#K, t#j) 


(4.31) 


= ttraceP, = P;. 


NOTES 


The possibility of apprehending the state of a system by the density 
matrix was apparently first noticed by Landau [1]. Shortly afterwards, 
a pair of papers was published by von Neumann [2] employing the 
density matrix for a very complete analysis of the probability interpre- 
tation of quantum mechanics, and of the thermodynamics of quantum 
mechanical ensembles. See also Dirac [3] and Pauli [4]. 

A comprehensive review of applications of the density matrix has 
been given by Fano [5]. 

Williams [6] uses in Chapter VIII the same formalism for a particu- 
larly lucid analysis of polarization effects in scattering processes. 
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*SECTION 5 


The Theory of Selective Measurements 


The density matrix description of physical objects can be considered 
as the special case of an even more general development based upon the 
analysis of experimental situations realized by selective measurements. 

To start, introduce a symbol M(a;) representing a single selective 
measurement with an apparatus that accepts objects possessing the 
value a, of the attribute A and rejects all others. If A is understood to 
represent a complete set of compatible observables, A = A) x A@ x ..., 
then M(a;) describes a complete selective measurement, such that the 
object chosen possesses definite values, symbolized by a;, for a maximum 
number of attributes. The selective measurements M(a;) and M(a,), 
performed in either order, must in the second step result in either the 
acceptance of all objects, namely when i =k, or the rejection of all 
objects, namely when i 4 k. Symbolizing with J and O the measurements 
that respectively accept and reject all objects, the sequence of measure- 
ments can be represented by the multiplication of the corresponding 
symbols, satisfying 


(5.1) M(a;) M(a,) = M(a,) M(a,;) = 8% M (a) 
where 

I for i=k 
ce) Bu = ‘ for i # k. 


Tf one further defines the addition of such symbols M(a,), M(a,) to mean 
a less specific selective measurement, admitting all objects with any of 
the values a;, a in the summation, then the completeness of the observ- 
able A is contained in the statement 


(5.3) Y Ma) = 1. 


To accommodate the disturbance of the object by the measurement 
and the existence of incompatible sets of observables, one must consider 
the most general selective measurement performed with an apparatus 
that rejects all objects entering except those in the state b; and permits 
only objects in the state a, to leave. Symbolizing this measurement 
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process by M(a,,b;), one should stress that the envisaged process implies 
complete ignorance about what goes on between input and output stage. 
Thus M(a,,6;) is not meant to be identical with M(a,).M(b,), in particular 
the symbol M (a,,a;) is not equivalent to the symbol O, for k # j. However, 
the symbol M(a;) may be taken as the special case in which no change in 
state occurs, 


(5.4) M(a;) = M(a, a4). 


It is often convenient to represent a measurement symbol, such as 
M(a,,b;) by a graph consisting of a “black box” symbolizing the region 
of ignorance between input and output stage, which latter are symbolized 
by a directed line entering and leaving the black box, with labels indi- 
cating the quantum numbers of input and output states, as drawn in 
Fig. 5.1. 


a bj 
Fig. 5.1. Black box diagram representing the measurement symbol M(a,,b,). 


To be specific, let +s, ss, +sg denote the possible values of spin of 
an object in directions x, y, z, respectively, so that, for example, —s, 
means “‘spin —4 in direction x.” The measurement M(+<s3, —s,), for 
example, can be realized by an experimental arrangement which is 
rendered graphically in Fig. 5.2, with the corresponding black box 
diagram drawn underneath. Everything has been drawn in one plane, 
omitting unimportant beam deflections, the magnets labeled according to 
the alignment of their magnetic fields along the respective axes. Each in- 
put and output stage corresponds to aselective Stern-Gerlach experiment, 
and the region of ignorance in-between represents the fate of the beam 
between experiments. 

The compound measurement M(q,,,6;).M(c;,,d;) admits objects in the 
state d; and permits them to enter into the state a,,, and is therefore a 
selective measurement of the type M(a,,,d;). If C and B are incompatible, 
only a certain statistically predictable fraction of the objects leaving the 
stage M(c,,d;) will be admitted into the stage M(a,,,b,) of the apparatus. 
Hence one has the multiplication rule 


(5.5) M (dm, b;) M (cy, d;) = <b;\e¢> M(am, d;) 


where ¢6;|¢,,> is a number which serves to express the statistical relation- 
ship between the states b; and ¢,. It should be stressed that the notation 
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Z / Yj 


+53 ~S) 


Fic, 5.2, Experimental arrangement corresponding to the black box diagram represent- 
ing the measurement symbol M(+<3, —81). 


for this number is purely conventional and does not, at this stage, imply 
anything about a possible interpretation in terms of geometry in an 
abstract space. 

The value of ¢b;|c,> can be ascertained immediately for the special case 
of compatible observables, because then one must have 


(5.6) Ma, b;) M(b,,¢;) = 8. M (An; Cj). 
Indeed, if b; 4 b,, then the second stage of the compound apparatus will 


accept none of the objects emerging from the first stage, while, if b; =b,, 
all such objects are admitted into the second stage, thus 


(5.7) <bj|bu> = Bix. 

Further special examples of (5.5) are 

(5.8) Maj) M(by, ¢;) = <aj\b,> (aj, ¢;) 
(5.9) May, by) M(c;) = <bg|e,> Maj, ¢). 

If the sequence of the stages in (5.5) is reversed, one has 
(5.10) M (cy, d;) M(am, bj) = di|n> M (cy, b;) 


which is, in general, different from (5.5). The multiplication of measure- 
ment symbols is, in general, noncommutative. From the completeness 
relation (5.3) and from (5.8) and (5.9) follows 


(5.11) HM (aj) M(by,¢%) = M(b,, ¢;) = D2) <aj|b,.> M(a;,¢;) 
d 
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(5.12) x M(a;,b,) M(¢;) = M(aj, by) = YX <byle:> M (ay, ¢;). 


Thus measurement symbols of one type can be expressed as linear 
combinations of another type. By repetition of this expansion one has 
generally 


(5.13) 

M(c,,d;) = ~ x M(a,) M (cy, d;) M(b;) =~ a (Gm|cy> <d;\b;> M (am, b;). 
mj 

Because of its ability to sponsor such connections, the set of numbers 


<a,|b;> is called the “transformation function relating the A- and 
B-description.” 


+53 +5; -S; -s; 


Fic. 5.3. Black box diagram representing the measurement M(-+83)M(—é2, —61). 


be -s) +33 +53 
Fic. 5.4. Black box diagram representing the measurement M(—s2, —81)M(+83). 


As an example involving compound selective Stern-Gerlach experi- 
ments, consider the measurement M(+8s)M(—s:, —s,), rendered 
graphically in Fig. 5.3. Equation (5.8) reads for this case 


(5.8a) M( +83) M(—s82, —81) = ¢+83|—82> M(+83, —81) 


and says that this measurement corresponds but to one channel which 
contributes with amplitude ¢ +8] —s,> to the measurement M( + 83,—8,) 
which has already been rendered graphically in Fig. 5.2. 

Carrying out the compound selective measurements of Fig. 5.3. in 
opposite order, one has a quite different situation, namely the measure- 
ment M(—s,, —s,)M(+8 3), rendered in Fig. 5.4, Equation (5.9) is now 
appropriate, 


(5.9a) M(—8, —8;) M(+83) = <—8,|+83> M(—s2, +83) 
and gives ¢ —s,| +83) as the amplitude with which the particular channel 
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contributes to the measurement M(—s», +83), which is different from 
the one depicted in Fig. 5.2. 
Transformation functions have the composition property 


(5.14) x aj|b,> <dy|ei> = <ajle;>. 
This follows from 


(5.15) x <aj|b,> <b, |e;> M (ay, ¢;) = x M(a;) M(b,) M(c;) = M(a;) M(c;) 


= <a,|c¢;> M(a,;, ¢;). 


Any linear combination of measurement symbols will be called an 
“operator,” and denoted by capital letters X, Y, .... Writing 


(5.16) i= x z <a,|X|b;> M (ay, b;) 

the expansion coefficients <a,|X|b;> can be arranged in an N x NV matrix 
scheme, and will thus be called the “matrix elements” of X. Operators 
are accordingly elements of a linear algebra of dimensionality N*, 
where N is the number of different values a; attributable to a complete 
set of observables A. In this algebra the number <a,|b,> can be regarded 
as a linear numerical function of the operator M(b,,a;), which will be 


called, in anticipation of an obvious matrix representation, the “trace” 
of M(b,, 4;), 


(5.17) <a;|b,.> = trace M(b,, a,). 
This definition is consistent, because the linear relation (5.13) leads to 
(5.18) trace M(c,,d;) = LY <am|cy> <d;|b;> trace M(a,,, b;) 
mi 
=z = <m|¢4> <di]B;> <bj|4m> 
= ¥ «ails <bileg> = <dilen>. 
As special cases one obtains 
(5.19) trace M(a,,a;) = dy 
(5.20) trace M(a;) = I. 
The trace of a product of two measurement symbols is 
(5.21) trace [M(a,,, b;) M(c;,,d;)] = <bj|¢,> trace M (ap, d;) 
= (j\¢,><di|am> 
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and one has also 
(5.22) trace [M (cy, d;) M(am,b;)] = <d;|am> trace M(c,, b;) 
= di|an> <bj|e4>- 
Hence the trace of a product of two operators is independent of the order 
of multiplication, 
(5.28) trace (X Y) = trace (YX) 


even though the operators themselves do not commute. A special case 
of (5.21) is 


(5.24) trace [M(a;) M(b,)] = <bg|a;> <ai>. 


The general multiplication law (5.5) of measurement symbols and the 
definition (5.17) of the trace are obviously invariant under the trans- 
formations 
(5.25) M (ay, b;) > y-*(Ay) M (ay, b;) y(0;) 

<q,|b;> > y(ay) <4 |b;> y-*(0;) 
where the numbers y(a,) and y(b;) can have arbitrary nonzero values. 
The quantities M(a;) and <a;|a,> = 5, are seen to be invariant, too. 

Contact with the statistical interpretation of quantum mechanics 
developed in the preceding sections can now be established most naturally 
by considering the sequence of measurements M(b;) M(a;,).M(b;) which 
differs from M(b;) because of the disturbance produced by the measure- 
ment of A in the step M(a,). Only a fraction of the objects admitted in 
the first stage M(b,) is transmitted through the last stage M(b,) of the 
apparatus. In the equation 


(5.26) M(b;) M (ay) 1 (b;) = play, b;) M(b;) 
the number 
(5.27) P(A, b;) = <ay|b;> <b;|q,> 


is, in contrast to (a,|b;>, invariant under the transformation (5.25) and 
has the additive property 


(5.28) M1 (b;) LA (a,) + M(a;)] M(b;) = [p(ap, b;) + pj, 0;)] 0 (;) 
so that from 


(6.29) M()[¥ a1la)].6) = a0) 


follows 
(5.30) EP, 6) = 1. 
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Thus p(a,,0;) is qualified to serve as the probability for observation of 
a, if the object is known to be in the state b;. Since a probability must 
be a real non-negative number, the numbers <a,|b;> will be subjected to 
the admissible restriction 


(5.31) <bilag> = <a,|bi>* 
because then 
(5.32) P44, b;) = |<ay|;>|® > 0. 


The condition (5.31) imposes on the numbers y(a,), y(b;) in (5.25) the 
restriction 


(5.33) v*(de) = Ma); 
they must therefore be of the form 
(5.34) (ay) = ehean) 


with arbitrary real phases (a,). 
From the definitions (5.16) and (5.17) of operator and trace follows an 
expression for the matrix elements of an operator, 


(5.35) <q,|X|b;> = trace [X M(b;, a,)]. 

Indeed, 

(5.36) trace [XM(b;,a,)] = YY am|X|b;> trace [M (am, b;) M(b;, ay)] 
jm 


= LLY An |X|b;> (5;|b;> trace M(ap, ay) 
J ™m 
= z m| X|B;> <u |@m> = <ay|X|b;>. 


The expression for the average value of an operator X in the state b; 
follows from (5.35) by specialization, 

(5.37) X = <b|X|b> = trace [XM(b;)]. 

The measurement symbol M(b,) is thus recognized as being identical 
with the density matrix J1/(b;) introduced in the preceding section. The 


justification for denoting (5.37) as an average stems from the probability 
definition 


(5.38) P(A, b;) = trace [M(a,) M(b;)] 
which allows one to write the average value of the observable 4, namely 


(5.39) A, = E 4% p(n, bi) 
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in the form 
(5.40) A,, = ¥ trace [a, M(a,) M(b;)] = trace [AM(b,)] 
k 
provided 
(5.41) A= Ya,M (4). 
k 


In summary, the accompanying tabulation gives the alternative 
description of physically accessible quantities in the state vector and in 
the measurement symbol picture. 


State vector picture Measurement symbol picture 
(by)e(ai)e or |b.><a M,,(bpy 44) OF M(bpy 44) 
(bx)elba)t M,,(b,) 
<a,|b,> trace M(b,,a,) 
<ayla> = dx trace M(aj,a,) = dx 
[<aglbi>|? = P(e 4) trace [M(b,) M(a,)] = play i) 
<a|X[B,> trace [XM (bj, a,)] 
<b|X|b> = X trace [XM(b,)] = X 

NOTE 


Sections 5 and 6 follow closely a development by Schwinger [1]. 
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“SECTION 6 


The Representation of Nonselective Measurements 


The measurement symbols M(a;) = M(a;,a;) considered in the preced- 
ing section involved an apparatus which, in addition to separating the 
ensemble of objects into subensembles belonging to various states @,, 
dz, ..., Selected one subensemble a; and rejected all others. 

It is, of course, possible to have an apparatus which performs a non- 
selective measurement by performing only the separation into sub- 
ensembles without the selecting stage. If the separation takes place with 
respect to the observable B, the corresponding measurement symbol will 
be denoted M, and rendered graphically as in Fig. 6.1. To obtain a 


Fic. 6.1. Black box diagram representing the nonselective measurement symbol M,. 


representation of M, in terms of selective measurement symbols M(b,), 
consider an object and then subject it, in succession, first to a selective 
measurement M(b,,c;) and then to a selective measurement M(a;,b,) as 


aj Ea. gy «te 


Fig. 6.2. Black box diagram representing the measurement M(a,,b,)M(b,,¢;). 


rendered in Fig. 6.2. The probability that the object will exhibit the 
value , of B and then the value a; of A is 
(6.1) PUG, By, C;) = (Aj, by) P(g, Ci) = |<a5]Dg> CO, |e,>|? 


= |<aj|M(b,)|e;>|? 
39 
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where use has been made of (5.35) and (5.9), giving 

(6.2) <aj|M(b,)|¢;> = trace [M (¢;, a;) M(b,)] = <a;|bj,> trace M(c¢;, by) 
<aj|bj,> dy |e;>- 

Now suppose the intermediate measurement of B were not made at all, 


so that the intermediate measurement symbol in the matrix element 
(6.2) can be replaced by the identity operation J = } M(b,), correspond- 
% 


ji i 


Fic. 6.3. Black box diagram representing the measurement M(a,,¢;). 


ing to the process rendered graphically in Fig. 6.3, then the correspond- 
ing probability is 


(6.3) P(aj,1,¢;) = |<ajle;>|? = [= <aj| IL (b,)|e4>|?- 


One has in this case of “coherent”? B subensembles an addition of 
probability amplitudes, giving rise to “interference” effects, the terms 
“eoherent” and “interference” being coined because of the profound 
analogy of (6.3) with the description of interfering coherent light rays 
for which there is an addition of amplitudes. 

On the other hand, if, in the intermediate stage, the B-separation 
apparatus is turned on, but without the selecting stage, as rendered 


aj by & 


Fia. 6.4. Black box diagram representing measurement with intermediate B-separation 
apparatus turned on. 


graphically in Fig. 6.4, then the probability for finding a;, if ¢; is the 
initial state, is given by 


(6.4) p(a;,6,¢;) = E (45, bus 6) = z |<a;| JZ (b,)|e:>|*. 
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Thus, in this mixture of incoherent B subensembles one has an addition 
of probabilities themselves, the difference from (6.3) being the absence 
of interference terms between the different states b,, corresponding to 
the addition of intensities in the optical analog. 

The symbol associated with the nonselective B measurement may 
therefore be taken to be 


(6.5) My = Det M(by) 


with randomly distributed phases 4(b,), which express the uncontrollable 
nature of the disturbance produced by the nonselective measurement. 
The probability (6.4) accordingly takes a form similar to (6.1), 


(6.6) P(4;, 6, ¢;) = |<a,| M,|e;>|?. 
Since the nonselective measurement does not reject objects, one must 
have 
(6.7) Y Play, b,c) =2 <e;| MP |a;> <a;|M,|e;> = <o,| My M,le;> = 1 
d 


which means that the operators M, are unitary, 

(6.8) Mj M, = M, Mz = I. 

It should be noted that the selective measurement symbol M(b,) can be 
obtained from the nonselective symbol M, according to (6.5) if all but 
one of the phases are replaced by positive infinite imaginary numbers, 
corresponding to an absorption of all but one of the subensembles 
produced by M,. 

As an example, identify the states ¢;, b,, and a; as follows. 


|c;> = (state with spin +1 in direction z) = (i) 


be fs . rales ‘ _ { cos (8/2) 
a |b1> = (state with spin +1 in direction 9,9) = i (8/2) a) 
; = yee sates g ) _ [Sin (B/2) 6 
|b2> = (state with spin — 1 in direction 3, 9) = ( cos (8/2) ) 
|a;> = (state with spin +1 in direction x) = vali) 


so that the density matrices corresponding to the measurements of b, 
and b, are 
MBN fees (8/2) sin ()/2) cos aga 
: sin (9/2) cos (9/2)e'? sin? (9/2) 
‘sin? (9/2) —sin (}/2) cos (9/2) e*? 
—sin (9/2) cos (9/2) e"? cos? (/2) ) 


(6.10) 
M(b.) = ( 
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satisfying obviously 
0 
(6.11) M(b;) +M (bo) = (i i) Ts 
The matrix elements relevant to the probability (6.1) are then 
: ies cos” (9/2) 
(6.12) <a;|M(b,)|e> = Re ie 
= (1/*/2) cos (8/2) [cos (/2) +sin (9/2) e'?] 
_ 1 = (sin? (0/2) 
(6.13) <ay].B(bs) I> = 5h i (mein (#/2) cos winee 


(1/12) sin (0/2) [sin (9/2) — cos (6/2) e'#]. 


Thus, if no measurement of spin in direction #,¢ is made between the 
selective measurements of spin +1 in direction z and in direction x, the 
probability is 


(6.14) p(a;,1,c;) = |<a;le,>|? = | x <a;|-M(b,)\c;>|° = 4, 
whereas turning on the separating magnetic field in direction 0,9 
without selection in the intermediate stage results in 
(6.15) 
P(4;,6,¢;) = z |<ayj| M (bx) |e>|? 
= cos? (f/2) (1+sin# cos 9) +4sin? (9/2) (1—sin# cos 9) 
= 4(1+sindcos# cos 9). 


NOTE 


The reader’s attention is drawn to a most interesting paper by 
Albertson [1] in which a careful analysis of the measurement process 
itself is given, by consistently including the measuring instrument into 
the quantum mechanical description without assuming knowledge of the 
premeasurement state of the instrument. 


REFERENCE 


[1] J. Albertson, Quantum mechanical measurement operator, Phys. Rev. 129, 940 
(1963). 
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The Fundamental Dynamical Postulate 


The aim of any dynamics is to predict the outcome of experiments. 
The dynamical equations describing any physical object should enable 
one to compute the outcome of a complete measurement at time t, if the 
outcome of this measurement at time ty is known, and provided no other 
observations of the object are made in between the times ¢ and ty which 
may alter the initial state established by observation at time to. 

All physical objects which can be grasped with the language of 
quantum mechanics to date seem to satisfy the following fundamental 
postulate. 

For any physical object there exists a hermitean operator H = H* so 
that the expectation value of any observable A satisfies the equation 
(7.1) A = i(HA—AB) + (04/@) 

= i<b|HA— AH|by + <b|0A/t|b> 

= itrace [(HA—AH) M]+ trace [(0A/dt) M]. 
This formulation is clearly invariant under unitary transformations. 
The operator H is called the Hamiltonian of the object, because in many 
cases it happens to be identical with the corresponding Hamiltonian in 
the classical description of the object, provided the canonical variables 
on which H depends are replaced by suitable operators. In fact, it was 
this correspondence with classical mechanics which first led to the 
discovery that postulate (7.1) constitutes a quantum mechanical 
description of the dynamics of some objects. 

It must be borne in mind, however, that there are many physical 
objects for which there exists no classical analog, and the construction 
of a suitable Hamiltonian is often the most difficult problem encountered 
when description of an actual physical object is attempted. The lack of 
an unambiguous recipe for finding the Hamiltonian of an object is one 
of the major shortcomings of contemporary quantum mechanics. 

The rate of change with time of the expectation value A of an observ- 
able A may be written, depending on whether A is given in the form 
A = <b|A|b> or in the form A = trace(A UM), either as 


(7.2) A = ¢b|A|b> + <6] [b> + <b] 46> 
43 
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or as 
(7.3) A = trace (4M) + trace (AM). 


Now there are several ways in which one may picture the evolution of 
a physical object described by Eq. (7.1). The following three points of 
view or pictures which may be attached to the representation (7.2) are 
particularly useful. 


(I) The State Picture. Equation (7.1) is satisfied by assuming 


(7.4) A, = 0A,/dt 
so that the evolution of the state is governed by the equation 
(7.5) i|6>, = H|b>,. 


This means in this state picture the state of the object at time t is defined 
by the outcome of a complete measurement on the object at that time. 
All observables are represented by the same operators for all times. 
Equation (7.5) is widely known under the name of Schroedinger's 
equation. The state picture is accordingly often called the Schroedinger 
picture. It should be noted that with respect to time this equation con- 
tains the first derivative only. Depending on the precise form of H it 
resembles in some cases a diffusion equation with imaginary diffusion 
constant. 
(II) The Operator Picture. Equation (7.1) is satisfied by assuming 


(7.6) |d>o = 0 
so that the evolution of the object is governed by the operator equation 
(7.7) tAy = Ay H—HAy+i(0A)/@t). 


This means in this operator picture the state of the object is fixed as a 
vector defined by the outcome of all possible complete measurements on 
the object throughout its history. It is the representation of observables 
which now varies with time according to Eq. (7.7). This picture is widely 
known as the Heisenberg picture. 

One should be careful about the symbols used to denote differentiation 
with respect to time. It is always understood that A =aA)/at; 
|b> = @|b>/dt. Equation (7.7), governing the operator picture, should, 
strictly speaking, be written i(DA /Dt) = Ay) H —HA,+i(0Ao/dt) where 
DA)/Dt stands for an operator which has the expectation value 
(DA,/Dt) = 0(A)/ét. For the Hamiltonian itself one has thus quite 
generally DH/Dt = 0H /@t. Use of the symbol d/dt for differentiation with 
respect to time may lead to misunderstandings, because |b> depends in 
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general on other variables, e.g. the position q of the object, which may 
also be a function of time, and in (7.5) and (7.6) the differentiation |b» 
does not include differentiation with respect to the implicit time depen- 
dence of |b»> via the dependence on q(t), say. 

(III) The Interaction Picture, If the Hamiltonian of the object can be 
meaningfully split into a ‘free’ Hamiltonian H°, which is time inde- 
pendent, and an “interaction” Hamiltonian H’, which may depend on 
time, 


(7.8) H = H°+H’ 

then it is sometimes convenient to satisfy Eq. (7.1) by assuming 
(7.9) iA, = A,H°—H°A,+i(0A,/dt) 

59 that 

(7.10) i|b>; = H'|b»,. 


This means in this interaction picture the evolution of the state is 
determined by the interaction Hamiltonian alone, provided the operators 
representing observables are made to change with time according to 
Eq. (7.9) which is governed entirely by the free Hamiltonian. 


The choice of picture employed in the solution of any problem is 
dictated solely by convenience. The invariance of the fundamental 
dynamical postulate under changes in picture must correspond to the 
invariance of (7.1) under certain unitary transformations which connect 
the state vectors |b>,, |b>9, |b>; and the operators A,, Ay, A, of the 
various pictures which describe the same physical situation. 

As a first example consider the unitary transformation connecting the 
state picture with the operator picture, 


(7.11) |b>, = U|b>o; A, = UA,U*; UUt = UtU =I. 
Since |b) is, according to Eq. (7.6), constant in time, it follows that 
(7.12) |b>, = U|byo. 

On the other hand, Eq. (7.5) yields 

(7.13) |), = —iH|b>, = —iHU|b), 

and one obtains for the transformation operator U the equation 
(7.14) iU = HU 


and by a similar argument applied to the inverse transformation 
U-1 = Ut+ 
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(7.15) —iU+ = UtH. 


If H is given and does not contain the time explicitly, then Eq. (7.14) can 
be solved formally immediately, to yield 


(7.16) U(t) = e#*U(0). 


Tf one adopts the initial condition U(0) = J, so that |b(0)>, = |b>o, one 
may write the first Eq. (7.11) 


(7.17) |b(t)>, = e*¥* |b(0)>, 


meaning the Hamiltonian is the generator of a unitary transformation 
which develops, in the state picture, the state vector in time. This is the 
quantum mechanical analog of the well-known result of classical 
mechanics, where the Hamiltonian is the generator of a canonical 
transformation which develops the system motion in phase space. The 
second equation (7.11) may now be written, because of the initial 
condition A (0) = A, for the case of operators which do not contain the 
time t explicitly, 


(7.18) Ao(t) = e#* Ag(0) et## 


and the Hamiltonian is thus also recognized as the generator of a unitary 
transformation which develops, in the operator picture, the operators in 
time. One can thus, in the Schroedinger picture, 1egard the state vectors 
as rotating and the operators with their eigenvectors as standing still, 
and, in the Heisenberg picture, regard the state vectors as standing still 
and the operators with their eigenvectors as rotating, the sense of 
rotation in the abstract space being opposite in the two cases. 
Tf |b(0)>, is an eigenfunction of H with eigenvalue w, then 


|b(t)>, = e* |b(0)>,. 
Knowledge of the solution of the eigenvalue problem 
(7.19) H\b> = w|b> 


is therefore, in principle, of importance. Unfortunately, the number of 
cases for which the eigenstates and eigenvalues of the Hamiltonian are 
known are very limited. The process of finding these solutions has often 
the character of a mathematical stunt. 

The central mathematical problem remains the solution of the system 
of Eqs. (7.14). The main concern will be with 

(a) those cases that can be solved easily for given H, 

(b) the ramifications of those cases that cannot be solved in closed 
form for given H, and 

(c) attempts at guessing solutions in cases for which at most some 
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general symmetry properties of an otherwise unknown Hamiltonian are 
known. 


As a very simple example consider the dynamics of a spin s, connected 
with the magnetic moment w= (e/m)s of the electron, in a constant 
external field B. The Hamiltonian giving the correct quantum mechanical 
description of this object is simply the interaction energy 


(7.20) H = —(pB) = —(e/2m)(oB) 


written in terms of the spin operator o introduced in Section 2, the 
components of B being treated as given parameters. 

Working in a coordinate system in which B has a z component only, 
and taking as basic state vectors the eigenvectors of og, the fundamental 
equation in the state picture (7.5) reads explicitly 
(7.21) 


(G)- wna) ~-wama( 3 5 


and has the solution 


cos (9/2) 
sin (9/2) e'? 


i 


(7.22) a(t) = a(Oerut; —B(t) = B(0) e~taut, 
with the characteristic frequency 
(7.23) wy, = (eBs/2m) 


so that in terms of the polar angles in accordance with (2.41) 

(7.24) (B/a:) = [B(0)/a(0)]e*4* = tan [9(0)/2] exp t[ (0) —2w, £]. 

This means the expectation value of the spin which at time t = 0 points 
in direction #(0), 9(0) precesses around the direction of the external field 
with frequency 2w,. 

In the operator picture the same situation is described by the funda- 
mental equation (7.7) which reads in this example for each component 
of the spin, using (2.46), 

i6, = 0, H—Ho, = —w;(0,03—030,) = 2iws os 
(7.25) tidy = 02H —Hoy = —w,(0,03—0302) = —2iw,o, 
ios = 0, H—Ho, = 0 
being a special case of the general equation 
(7.26) 6 = 2(w,xo) 


which again means the expectation value of the observable 6 precesses 
around the direction of the external field B with angular velocity 
2w, = (e/m)B. This can be verified, by explicit solution, as follows. 
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The special case (7.25) has, according to (7.18), the solution 


(7.27) a(t) = entexoat g,(0) etiwzast 
(7.28) ao(t) = eMx9st go(0) etionost 
(7.29) ag(t) = eMxt g9(0) eter" = o5(0). 


Equation (7.29) allows one to replace in the exponents of (7.27) and (7.28) 
the operator a; by the constant operator o3(0), and one has the expan- 
sions, with the notation [AB] = AB-— BA, 
(7.27') oy (t) = 04(0) — (iz t)/Ufog oro + (tz #)*/2!Loslo3 or Iho 

— (iw, t)*/3'[ogloglog or] ]h-0 +--+ 
(7.28')  ag(t) = o2(0)— (iz t)/I [os o2}-0 + (’mzt)*/2[oslo3 o2]}—0 


— (tz t)?/3![osloslog oe) 0 +--+ 
The C.R.s yield 


(7.30) 
[e301] = 2iog; [eslo3ox]] = — (2%)? 04; (esloslosox]]] = 
= —(2t)8o9;... 
(7.31) 
[e302] = —2io,; [eslose2]] = — (21)? 9; [oslos[os o2]}] 
= (2t)8o43... 


so that finally 

(7.27") 

oy(t) = o4(0){1 —[(2az, t)?/21]+...}+.02(0){(2e,, t) — [(2wz t)3/3!]+...} 
= 0,(0) cos (2w,,t) + o9(0) sin (2wy, t) 

(7.28”) 

oa(t) = o9(0){1 —[(2ezt)?/21] +...}—01(0){(2ez #) — [(2w, t)9/3!]+...} 
= a9(0) cos (2wz,t) — 01 (0) sin (2w;,¢). 

The unitary operator connecting state picture and operator picture is 

(7.32) U(t) = exp [iw(,o-n) t]Z; n = B/|B| 

which by expansion and utilization of the C.R.s can be written 

(7.33) U(t) = cos (wz t) +i(o-n)sin (wt) 


and represents a rotation by angle 2w,¢ around n, in accordance with the 
remark made at the end of Section 3. This will be taken up in a more 
general context in Section 12, 
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It is instructive to write the development of an object in time in 
terms of the density matrix representation (7.3). From this and the 
fundamental dynamical postulate (7.1) follows the identity 


(7.34) 

trace (AM)+ trace (AM) = itrace[(HA—AH) M]+trace [(0A/at) M1]. 
In the state picture the condition (7.4) leads immediately to 
(7.35) 

trace (A, M,) = itrace[(HA,—A,H) M,] = itrace [4,(M,H-HM,)] 
which is satisfied provided 
(7.36) M, = i(M,H—HM,). 
It must be stressed that M, is a substitute for the state vector and does, 
therefore, depend explicitly on time, as does |b>,, in the state picture. 
The symbol denoting differentiation with respect to time of the density 
matrix means, as in case of the same symbol for the corresponding state 
vector, the partial derivative 
(7.37) M, = 0M,/at 
and one can thus write (7.36) in formal analogy to the concept of total 
differentiation 
(7.38) DM,/Dt = i(HM,—M,H)+(0M,/at) = 0 


which is the quantum mechanical analog of Liouville’s theorem in 
classical mechanics. This equation can be verified explicitly for a pure 
state by using the Schroedinger equation. If the decomposition 


(7.39) (Mix = (b)(b)E 


is adopted, one obtains by partial differentiation with respect to time, 
using (7.5) 


(7.40) (M)ix 


(b)(b)E + (0) (B)E = i x {Hy(b)(b)f — (b)(b)# Hix} 
-i(HM— MH), 


which is identical with (7.36) in components. 
The transition to the operator picture is effected by satisfying (7.34) 
with (7.7), requiring 


(7.41) My =0. 


This can be verified directly by applying to (7.39) the unitary trans- 
formation connecting M,(t) with M,(0). From (7.17) follows 


(7.42) M,(t) = |B(t)>, <b), = € |b(0), <b(0) |," = oe" M,(0) ef", 
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Now, M,(0) can be identified with the constant density matrix in the 
operator picture, M,(0) = Mo, because if an operator A, is transformed 
into Ay by : 


(7.48) A, = UA,Ut+ 
then the invariance of 
(7.44) A = trace(A,M,) = trace(UA, Ut M,) 
= trace(A,U+ M,U) = trace (Ay Mo) 
requires M to transform accordingly, 
(7.45) M, = U+M,U 
which is Eq. (7.42) with U(t) = eI. 
It is interesting to note that if the eigenvectors of H, 
(7.46) H\a;> = w;\a,> (say) 
form a complete set and can thus be used as a basis, then the matrix 
elements of (7.42) take the form 
(7.47) <a;|M(t)\a,> = My (t) = Myz(0) ror 
so that only energy differences, but no unobservable absolute energies, 
appear. 


Writing for the special example of the spin magnetic moment in a 
fixed external field the density matrix as in (4.30) 


(7.48) M, = 3[1+(Po)], 


where now P(t) is the time dependent polarization vector characterizing 
the spin, then the fundamental equation (7.38) reads, with 


H = ~(e/2m) (0B), 


(7.49) — (ie/2m) [(6B) (Pa) — (Pa) (6B)] + (AP/at)-o = 0. 
Now, by virtue of the C.R.s of the matrices o; one has 
(7.50) (6B) (Ps) — (Pa) (6B) = 2i[o-(B x P)] 

so that (7.49) can be written 

(7.51), [(@P/at) + (e/m) (Bx P)]-o = 0 

which requires as necessary condition the classical equation 
(7.52) (OP/dt) = —(e/m) (Bx P) 


describing again the precession of the polarization vector around the 
applied field with frequency 2w, = (e/m) B. 
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The visualization of a spin by means of a precessing polarization 
vector is thus by all means permissible and does not lead to contradic- 
tions with the probability aspect of the Stern—Gerlach experiment. 

As an example to which the interaction picture may, be applied, 
consider a spin magnetic moment p= ys subject to the external field 


(7.53) B = By+B, = Bok +B,{icos (wt) —jsin (wt)] 


corresponding to a fixed field Bo in z direction and a field B, rotating 
clockwise with angular velocity 


(7.54) wo = —uk 
in the (x,y) plane. The Hamiltonian can be written 
(7.55) H = ~»(sB) = —y(sBy) —y(sB,) = H°+H’. 


Tn the interaction picture, the equations governing the operators are 
therefore identical with (7.27), (7.28), and (7.29) and one has for the spin 
operators the representations 


ox(t) = 01(0) cos (wot) +02(0) sin (wot) 
(7.56) o9(t) = —o,(0)sin (wot) + o9(0) cos (wot) Wy = 2w, = yBo 
a(t) = o3(0) 


and the state vector in this picture is governed by the equation 


i|6>, = H'|b>, = —(y/2) (6B,)|b>, 
= —(#;/2) [01 (¢) cos (wt) — o(t) sin (wt)]|b>, 
en) = —(w/2) {o}(0) cos [(wy —w) é] 
+49(0) sin [(wo—w) t]}b>75 
ow, = yBy; 


which reads in components 


bra(s mG m= ( a) 
& = (iw, /2) eHor-w)t B 
{2 = (iea,/2) etorort gy, 
By differentiation of the first of these equations, and substitution of 8 
from the second, one eliminates 8 and obtains 


(7.57') 


(7.58) —&—1i(wo—w) &—(w 4/2)? = 0 
which may be solved by 

(7.59) a(t) = a(0) et/2Vt 
provided 


(7.60) 2? + 2(wy—w) Q—wit = 0 
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giving rise to two solutions 


(7.61) Q, = —(wy—w)+V (wy — a)? +0}; 
Qy = —(wo—w)—V (wo —w)? +0} 

so that the general solution of (7.58) reads 

(7.62) c(t) = 0¢4(0) ef Al20t + o9(0) etiSu/204 

from which one obtains then 

(7.63) 


Blt) = (2/tcas) Mert, = ox4(0) (Qy]er1) CUM! +. o¢9(0) (Qy/eu) eM, 
Now suppose at time ¢ = 0 the spin has with certainty the value +1 
in z direction, i.e. |b(0)> = |b,> = (), then 
(7.64) 
(0) = 04(0)+a2(0) = 1; B(O) = (0) (21/1) + 20) (@e/w1) = 0 
giving the initial values 
(7.65) (0) = 23/(Q,—2)); (0) = 24 /(Q,—2,). 


One can now calculate the probability for ‘spin flip,” i.e. the probability 
for finding at time ¢ the state |b_> = ({) in which the spin has the value 
—1inz direction, namely, 

2 e\ 2 2 
(7.68) Pag |KBL[E[F = [0 Ap = IBW*- 

ee 

By substitution of (7.65) into (7.63) one obtains thus, using 2; Q. = — wi 
and 2,;—Q, = 2V/ (wy — ow)? +wi, 


202 Q3 ton eien 
on Puy = aoa °°(—3 4) 
ot in2(Vlo-o a8, 
wi +(wo—w)* 2 7 


The amplitude of the spin flip probability shows thus a resonance at 
w = wo, which can be exploited to measure w, and thus y. 

By an argument similar to that leading from (7.34) to (7.36), the 
density matrix in the interaction picture is found to satisfy the equation 


(7.68) My, = i(M,H'—H' M;) 


from which one obtains by straightforward computation after sub- 
stitution of M, = {1 +(Poa;)] 
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Py = —woP2+o,Pysin (wt) 
(7.69) Py = +a9P,—w,P, 008 (wt) 
Ps = —w,P,sin (wt) +, P2 cos (wt). 


These equations are, of course, identical with (7.52) if B is given by (7.53). 
Their solution proceeds without difficulty. Differentiating P, twice with 
respect to ¢ and using the expressions for P, and P, one obtains 


(7.70) P,=-w*P,; with w= V(w—a)* +08 
and thus the solution 
(7.71) P(t) = Ae™+ Bem +0 


in which the integration constants A, B, C are to be determined from the 
initial conditions, which in the example treated above are 


(7.72) Px) =1; — P,(0) = P,(0) = 0 
leading immediately to 

(7.73) A= B = wi/Qw’*; C = (wo—w)?/w? 
and the solution (7.71) reads in this case 

(7.74) P(t) = (1/w*) [wi cos (wt) + (wy —w)*]. 


To obtain Py, by this method, one need not calculate P,(t) and P,(t). 
The density matrix is generally 


ae 
(7.75) M(t) = H+ (@P)] = eee oe) 


and the probability for finding the spin in any given direction #), 9 is 
P5,. = trace[M(), p) M(t)]. For spin —1 in z direction one has 


M(9,¢9) = M(b-) = (01), 
and therefore 


ae 0 0\/ 14+Ps P,—tP;\] _ 
(7.76) Paiy = Herace[( ee iii )| = $(1—P3) 


= (wi/w?) sin? (wt/2) 


in agreement with (7.67). 
The connection among interaction picture, state picture, and operator 


picture is established by unitary operators defined by 
(7.77) |b>, = Vb>y |d>; = W|b>o 
: VVF a WtV =f; WWt = WW =7 


so that 
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(7.78) |b>, = VW|b>o ie. |b>, = Uld>o with U= VW. 
Since one has 

(7.79) |b), = V[d>,+V|Byr = (V-iVH')[>y; [dr = Wldyo 

and also 


(7.80) 
|, = —iH|b>, = -tHV|b>;; [b> = —iH’|b>; = —iH’ W|byo 


it follows that V and W must satisfy the operator equations 


(7.81) V = i(VH'-HY); W = -iH'W. 
This is consistent with (7.78) because 
(7.82) 


U = VW+VW = i(VH’—HV)W-iVH'W iHVW iHU 


is identical with Kq. (7.14). 
Explicit solutions can be found for V and W, for the case of spin 
magnetic resonance treated above, without difficulty. 


NOTES 


The state picture and the operator picture of quantum mechanical 
dynamics emerged in famous papers by Schroedinger, beginning with a 
short note [1] and collected in book form [2], and by Heisenberg [3], 
followed by papers in collaboration with Born and Jordan, whose 
book [4] is entirely devoted to development of the state picture. The 
unitary transformation connecting the two pictures was first given by 
Schroedinger [5]. 

The interaction picture, although it is implicit in many earlier works, 
did come into its own right through development by Tomonaga [6]. 

Rabi [7] first gave the exact solution of the magnetic resonance 
problem for the case of a spin 4. 
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SECTION 8 


The Representation of Observables with Nondenumerably 
and Denumerably Infinite Ranges of Possible Values 


To grasp attributes capable of a nondenumerably infinite or con- 
tinuous range of possible values it is desirable to have a description which 
is formally identical with the one developed for discrete quantum 
numbers in Sections 1 and 2. Thus a state depending on a continuous 
attribute q will be denoted |q> and will be represented by a complex unit 
vector in an abstract space (“Hilbert space’) of nondenumerably 
infinite dimensions. Formally, one may thus expect |q> to be an eigen- 
vector of a suitably defined operator Q@ with eigenvalue q in analogy to 
the vector |a;> in a finite dimensional space, which is an eigenvector to 
the operator A with eigenvalue a,;. The correspondence 


(8.1) Ala;> = aja,» > Q|a> = a|a> 
can be extended to all operations defined for the vector |a;> in Sections 


1 and 2, provided one can formally introduce in the Hilbert space the 
notion of orthogonality by the correspondence 


(8.2) <ai|a> = du <q'|7"> = 8(7'-g") 
and the notion of closure by the correspondence 
(8.3) E la) <a] = Le f |a>daca| = 2 


where 6(q) is the well-known delta function (see Appendix 6) and J the 
identity operation. 

In particular, one can expand the eigenvector |p> of another con- 
tinuous attribute P in terms of |g> by the correspondence 


(8.40) [b> = E |as><aylb,> <> [p> = f |a">4da"<a"|p> 
and similarly 
(8-4) Ce] = E <eleud as] «> del = f <rla’>da’<a. 


The transformation functions <q'|p> are frequently called “wave 
functions” or simply “‘ys functions” and denoted 
55 
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(8.5) (7) = <a'|P>- 


Referring to the representation spanned by the eigenvectors of @ as the 
q representation, one can say y,(q’) is the component of the vector |p» in 
“direction” q’ in the q representation. Knowing all these components 
y,(q7') is thus tantamount to knowing the vector |p>, the components 
1h,(q') represent |p> in the q representation. 

The scalar product of two vectors |p> and |r) can now be evaluated 
in q representation in terms of their ys functions as follows: 


<rlp> = ff <rla’>da’<a'la"> da" <a"lw> 
= ff <rla’> da’ 8(q’ —4")dq"<a"|p> 


(8.6) = | <rla’>da’ca'lp> = f vE@)¥o(a') da 
so that, in particular, the orthogonality of two states reads 
(8.7) <p"|p’> = f HEA) tyla')da’ = 8(p"—p'). 


In q representation, the operator Q can be thought of as an infinite- 
dimensional diagonal matrix, because its matrix elements are definable 
by the correspondence 


(8.8) 
<a,|Ala> = a<a,|a;> = 4:80 <a"|Qlq> = 1'<a'|7> = 7 87-7"). 


The effect of any operator f(Q) on an arbitrary state, |p> say, with 
components y,(q7') in q representation is accordingly given by 


(8.9) f(Q)|p> = | F@\a">da"<a"lp> = | |a"> aa" fa") 4010") 
so that 
(8.10) <r|fQ)|p> = f oECa") Sa") ¥o(a" aa". 


In particular, the operator P = —i(0/0Q) has in q representation the 
matrix elements 


(8.11) <a" |P\q'> = —4(0/0q')5(q' -9") 
so that in terms of //-functions 
(8.12) <r\P\p> = f F(a") L— 4 Op(q")/2q"} dg". 


It is, on first sight, rather suprising that one can pass from a represen- 
tation spanned by the eigenvectors of a continuous operator, |q> say, 
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to a representation spanned by the denwmerably infinite set of eigen- 
vectors of a discrete operator, |Z;> say, where 


(8.18) H\E> = w| ED; i = 1,2,..., ad infinitum 
so that 

(8.14) [Hi = { |a’>dq'<a' |B 

and 

(8.15) Ig'> = & |B) <Hila’> 


which requires the ¢ functions to be subject to the unitarity conditions 
(8.16) 

<BylHi> = f Bala’ da’ <a|B> = J YE bela) da! = 8 
(8.17) 

<a'lq'> = 3 <a" By Bila’> = & (a) $5) = 3(q"—<'). 

i 

The possibility of the transition from the HZ representation to the q 
representation and vice versa is thus dependent on the existence of 4 
functions satisfying (8.16) and (8.17). Such existence problems can 
always be settled if one succeeds in actually constructing such y func- 


tions. This will be done for some special cases in the next section. 
The measurement symbol M(p,r) corresponding to the graph 


=i 


can also be expressed in terms of the ys functions according to (8.4) as 


(8.18) (pr) = |p><r| = ff |a">da” ¥p(a") va") da’ <a'| 
and its matrix elements in g representation are found to be 


(8.19) 
<¢"|M(p,r)|q> = Jf 8(" —9")da" ¥p(q") oF (a) da’ 8(a" —@") 


= 2,0") (9) 


The density matrix for a state |p), in particular has the form 


(8.20) <q'|M(p)|9> = +p(a') $F (a) 


and is seen to have trace unity if the functions are normalized. 
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NOTE 


Readers plagued by mathematical scruples regarding the concept of 
Hilbert space will find solace and comfort in Chapter II of the monu- 
mental work by von Neumann [1]. 


REFERENCE 


{1] J. von Neumann, “Mathematical Foundations of Quantum Mechanics.” 
Princeton Univ. Press, Princeton, New Jersey, 1955. 


SECTION 9 


Displacements of the Observer 


Any actual observation of a physical object is always made in a certain 
coordinate system. To be precise when describing the state of the object 
one should thus, in principle, always include information about the 
position of the observer. Any state |b> must therefore contain, in addition 
toall the variables attributable to the object, such as spins, momentum p, 
etc., the coordinates of the observer, z; say, and can be considered in fact 
a function |s,p,...,2;). 

When describing a single object it will now be assumed tentatively 
that one can meaningfully define an operator Q which represents the 
location of the object such that one can represent the state vector in 
terms of the eigenvectors of Q, denoted |q>, satisfying Q|q> = q|q>. 

Postponing questions of how to actually determine experimentally 
the components of the state in this “coordinate representation,” occa- 
sioned by the continuous range of possible values q of the location Q, 
making |q,...> necessarily an infinite-dimensional vector, the position 
variables q will now, by convention, be chosen to originate at the position 
of the observer, so that |q,...> describes the state of the object as seen 
by an observer (0) located at q = 0. It must be stressed, however, that 
the possibility of ascribing a fixed position to the observer should be 
considered as a tentative hypothesis, pending investigation of whether 
this hypothesis is compatible with the various tasks of measurement 
assigned to the observer. 

Disregarding then, for the time being, these profound questions regard- 
ing actual position measurements, it will now be attempted to describe 
the same object in terms of the state used by an observer (1) who is 
displaced with respect to (0). If, in particular, the origin of (1) is displaced 
from the origin of (0) by a vector a pointing from (1) to (0), as indicated 
in Fig. 9.1, then (1) will describe the object in terms of a state 
(9.1) la” = |a+a). 

From the requirement that the physical attributes of the object be 
invariant under displacements of the observer, one can infer the 
existence of a unitary operator 7’, of displacement, defined by 


(9.2) lat+a> = 7,|q>. 
59 
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eps allen 


(1) 
(0) 


Fic. 9.1. Description of the position of an object (X) as seen by an observer (1) dis- 
placed by a with respect to an observer (0). 


In this coordinate representation one can apply to each component the 
expansion 


(9.3) la+a> = [1+ (a.0/0Q) + (4!) (a.2/0Q)?+...]]q>- 
Introducing an operator 

(9.4) P = —i(0/0Q) 

one can write siete: 

(9.5) = [I+i(aP) + (é?/2!) (aP)?+...] = e@?). 


The inverse transformation is similarly given by 
(9.6) Pheer MOR), 


The unitarity of 7,, 7;1=T; requires then that P be a hermitean 
operator. 
To elucidate the physical meaning of the operator P consider the 
special case in which the displacement vector a is a linear function of 
time, 


(9.7) a = Vt. 


One envisages the observers (0) and (1) moving relative to each other 
with constant velocity V. The transformation operator 7’, is now a 
function of time, and represents the Galileo transformation. 

Whenever in the equation 


(9.8) |b” = Tb» 
the operator 7' depends on time one must distinguish carefully between 


(i) (0/d#) ( |b>”) which tells how observer (1) finds his state description 
of the object varies with time, and 
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(ii) (0|6>/@t)” which tells how observer (1) finds observer (0)’s state 
description of the same object varies with time. 


Since |b) satisfies, in the state picture, the equation 
(9.9) 4 0|b>/ot = H|b> 
one obtains by application of 7’ from the left 
(9.10) i(8|b>/0t)” = H™|by? with A TET 
On the other hand, differentiating (9.8) with respect to time and using 
(9.9) one finds 
(9.11) 

i(0/At) (\b>”) = iT (A|b>/At) +17 b> = (A? +i2T)|by". 

This means the effective Hamiltonian of observer (1), namely the Hamil- 


tonian which determines the development of observer (1)’s state |b>” in 
time, according to the fundamental dynamical postulate, is given by 


(9.12) Hee = BT iT T+, 
For the special case of the Galileo transformation (9.7) one obtains 
from (9.5) 
(9,13) Hg = H*—(VP). 

Now in classical mechanics the Hamiltonian of a free particle of 
momentum p and mass m is 


(9.14) H = (}m)p? 
and the effective Hamiltonian after a Galileo transformation is 
(9.15) Hog = (4m)p?—(Vp) 


giving rise to the correct addition of velocities, 

This correspondence suggests the identification of the operator P with 
the operator representing the momentum of the object. From this identi- 
fication, it follows immediately that the position and momentum of an 
object are incompatible observables, because for each component P;, Q, 
one finds 


(9.16) PjQ,-Q,P; = —il8,. 


Itis instructive to work out the transformation formula for the position 
operator as a consequence of this commutation relation. By expansion 
one has, for each component, 


(9.17) 
Q = T,QT;' = e? Qe? = Q+ia[PQ] + (i?/2!)a°[PLPQ]]+... 
=Q+a=Q4+Vt 
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as expected, Differentiation of (9.17) with respect to time gives 

(9.18) 

0Q7/at = T,(0Q/0t)T,1+T,QTa'+T.QTa = (8Q/at)" + iV(PQ” —Q?P) 
= (0Q/dt)" +V 

as expected, because (0Q/dt)” tells how the observer (1) would find 

observer (0)’s position of the object vary with time, and Kq. (9.18) is just 

the “addition theorem of velocities” in accordance with the invariance 

under Galileo transformations. 

Considering now, as an example, the Hamiltonian (9.14) of a “free” 
object which possesses momentum as an attribute, the state of the object 
satisfies, in the state picture and in coordinate representation, the equa- 
tion 
(9.19) i(0/At)|q(t)> = (4m) P*|q(t)> = — (4m) (2/0Q)*|q(t)>. 

The state vector |q(t)> is thus obviously not an eigenstate of H. Denoting 
the eigenstate of H with eigenvalue w,; by |H;>, satisfying 


(9.20) H|E,> = w,|ED 
and writing in accordance with (7.17) 
(9.21) |a()> = e-*|q(0)> 


one can, upon expansion of |HZ;> in terms of |q(0)>, introducing the 
time-independent ¢ function bz,(q) = <q(0)|2;> by 


(9.22) [i> = f |a'(0)>da’ dai(a’), 
cast the eigenvalue problem (9.20), upon application of (8.9), in the form 
(9.28) H\B,> = { |a'(0)>dq'(— 4m) (0*p,/24) 


= 0% f |a(0)> dq’ bn,(4)- 


The linear independence of the components |q’(0)> requires then that 
the y function characterizing the state |2;> in q representation satisfy 
the eigenvalue equation 


(9.24) (= dn) 2? hp, (q)/0q? = wPz,(q). 


It is often convenient to expand |q(t)> directly, with the help of a 
time-dependent % function Y 


(9.25) 
Ja(t)> = x |2;> <Ei|q(t)> = x |Ei> ¥3,(q,t) = eH x |Ei> $8,(q) 
= [Bde g5 (4) 
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so that the time-dependent y function and time-independent y function 
are related by 


(9.26) Po,(q.t) = ep, (q)- 

Equation (9.24) has obviously the solution 

(9.27) n,(q) = const eo 

provided k; and w; are connected by the relation 

(9.28) w;, = (ki/2m). 

In this description there is thus associated with an object of energy Z 
and momentum p a characteristic frequency w and a characteristic 
length |k|-* 

(9.29) o=H; ki=1p 

which determine the probability amplitude for finding the object at 
time ¢ at position q, if it is known to have energy # and momentum p 


(energy # and momentum p being compatible, because HP —PH = 0 in 
this case), namely 


(9.30) <q(t)|Z> = Yz(q,t) = const elotko) 
where the constant has to be determined by the normalization of the 
probability. 

Tn the operator picture, the development of the object described by 
the Hamiltonian (9.14) is given by the operator equations, using (9.16), 


aon ie = i(HP,—P,H) = 0 
31 
Qa = i(HQ;—Q; H) = (i/2m) (P? Q; —Q; P*) = P|m 
so that the expectation values of the momentum and coordinates of a 


free object satisfy the classical relationships 


(9.32) P=0; P=m@ 

This correspondence to classical mechanics is ultimately the justification 
for considering the Hamiltonian (9.14) as the “correct”? Hamiltonian 
to be inserted in the fundamental dynamical postulate of quantum 
mechanics for a free object. 


NOTES 


The historical event which marks the advent of quantum mechanics 
was publication of a paper by de Broglie [1], who first noticed that by 
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association of a frequency w and a length |k|~ with energy and momen- 
tum of a particle, one can describe its propagation in a language devised 
originally to describe the propagation of a signal 


+0 
fat) = f explitwt—kg)]f(k)dk; wo = VmF-+ke, 
—w 
made up out of superposition of ys functions. 

Since the phase velocity of the ys function, V(phase) = w/k, is obviously 
larger than the speed of light, as long as m +0, one cannot identify 
propagation of the % function with the motion of some observable 
material, and the probability interpretation of is, indeed, widely 
accepted today. 

However, a temptation presents itself: The functions making up 
the signal f(q,t) are by analogy with interference patterns observed, for 
example, on the surface of liquids suggestive of some underlying medium 

r “ether.” Such is the human urge to hang on to the familiar that a 
number of distinguished physicists, de Broglie among them, have 
plunged into elaborate, often desperate, efforts to reconstruct from the 
observable features of matter the hydrodynamics, as it were, of that 
hypothetical, not directly observable, ether which is envisaged as the 
stage below the so-called elementary particles in the hierarchy of nature. 

From a strictly operational point of view all such attempts have thus 
far proven to be sterile, because they have not led to a single experiment 
which could be used as a crucial test. A comprehensive list of references 
to this approach is contained in the work of Takabayasi [2]. 

The proof that expectation values of momentum and coordinate of 
single objects satisfy the classical relationships was given by Ehrenfest 
[3]. 
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“SECTION 10 


Uncertainties and the Relations between Them 


A concept basic for any theory of measurement is the uncertainty 4A 
in the experimentally realizable knowledge of an observable A if the 
object is in a given state |b). Define 4A as the root of the mean square 
deviation from the expectation value 


(10.1) (4A)? = (A—A)? = A?—(A)*. 
Introducing the notation “norm of |b)” 
(10.2) llb| = V<b[by 


one may write, using the hermitean property of A, 


(10.3) (44)? = [A—<8]4]6>P = <O|[A — <0] A|b>}?[0> 
= <(A—<|4]b>]5|[A — | 4B] 6> 
= |I[A—<b]A]o>] 0). 
The following fundamental theorem holds, 


If the operators P and Q representing two observables satisfy the 
relation 


(10.4) PQO=QP= =i 
then 
(10.5) APAQ > }. 


The inequality (10.5) is called an uncertainty relation. 
To prove it introduce the abbreviations 


(10.6) P=P-P =P-|Plb>; 9 = Q-Q = Q-<|Q|by, 
notice that again 
(10.7) PQ-@P = -i1 
and because of (10.3) 
(10.8) APAQ = |\Po||-||Q\). 
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Now apply to the right-hand side of (10.8) the well-known inequality 
(10.9) ijell- lol] > |<elo>], 


in which the equality sign holds only if |u> and |v) differ at most by a 
constant complex factor, and which is the generalization of the simple 
geometrical fact that the projection of two vectors on each other is never 
larger than the product of their lengths. One obtains 


(10.10) APAQ > |<Pb|Qb>| 
and therefore also 
(10.11) AP AQ > Im<Pb|Qb». 


By employing the identity Im(«+78) = —(i/2)[(«+7#8)—(«—7)], using 
<ul|v>* = <v|w>, and supposing |b) to be normalized, one finds 


(10.12) 

APAQ > —(i/2)(<Pb|Qb> — <Qb| Pb») = — (é/2) (<QPb|b> —<PQb|b>) 
= 4<i(PQ—QP) bby 
= HI? = 4 


which completes the proof. It follows further from this derivation that 
the product of the two uncertainties will be a minimum if the state |b> 
satisfies the condition 


(10.13) P\b> = iyQ|b;-y_-realand > 0. 


Such states are called optimum states. 

One finds, sometimes, statements to the effect that all pairs of 
canonical variables in classical mechanics are represented in quantum 
mechanics by operators satisfying (10.4) and are therefore subject to 
uncertainty relations (10.5). Such statements must be approached with 
caution, because the uncertainty relation (10.5) refers to simultaneous 
measurement of two observables, the object being in a single state |b). 
They may, therefore, possibly apply to the momentum and position of 
a single object. Now in a certain formal sense one may treat the time 
variable and the energy of an object in classical mechanics as canonical 
variables. It would, however, be quite wrong to infer from this that one 
cannot determine the energy of an object exactly at a given instant of 
time. A detailed study of the nature of energy measurement shows that 
the law of conservation of energy can be verified by two successive 
energy measurements w, and ws at times ¢; and t, only to an accuracy 
|w, —wa|(t2—t;) > 1, but this refers to measurements in which the object 
is in different states, namely |b(t,)> and |b(t,)>, so that the proof given 
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above is not applicable, and this “uncertainty relation for energy”’ has 
a different origin. For details the reader is referred to a particularly 
thorough discussion in the work by Messiah quoted at the end of this 
section. 

Now, identifying P and Q again with the momentum and position 
operator of a single object, Eq. (10.13) governing the optimum states 
reads, in q representation, 


(10.14) [—1(8/8Q)—P]|b> = iy(Q—Q) |b> 


so that the corresponding ¢ function (7) = <q|b> satisfies the first order 
differential equation 


(10.15) [-é(d/dq)—P]n(q) = iy(g—@) (a) 
yielding by integration 
(10.16) 


q@ 
dela) = exp f (—y9+yQ+iP) dq = Ceoxp[—(y/2)q*+70q+ iPa] 


or 

(10.17) Yo(q) = C' exp[—(y/2) (q—Q)* + iPaq] 

with some constant C’, to be determined by the normalization 
+0 

(10.18) <b|d> =f |ds(a)|®dg = 1 


—w 
which gives 
Py 


F 
(10.19) |C"? f exp[-y(q—@)"Idq = |C'|? Vly) = 1 


so that finally 

(10.20) oa) = (y/7)"* exp [-(y/2) (q-Q)? +*P al. 

The optimum state vector describes thus a Gaussian probability dis- 
tribution around the expectation value Q as center. It is now easy to 
compute P, AP, JQ for the optimum state (10.20). 

An instructive exercise consists of transforming |b) in the q representa- 
tion into |b> in the p representation, Quite generally these representations 
are defined by the eigenvalue problems 
(10.21) P\p> =plp>; lg = ala, 
respectively, and there should exist a unitary operator U so that 


(10.22) |p> =Ulg>; = |a> = U* |p, 
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in components 
(10.23) |p’> = f <p'|Ula">da"la">; la") = | <a"lU*|P'> dp'|p’> 


which transforms any operator A,,) in y representation into the corres- 
ponding operator A,,) in p representation, according to 


(10,24) A(p) = UAq Ut; Aw = UtAiy)U, 


in components 
(10.25) <p"|A|p’> = [f <p"|Ula">dq"<q"|Alq’> da'<q'|U*| ">. 


From the definitions (10.23), which may alternatively be written as 
expansions into ys functions 


(10.26) |p’> = f |a">da"vy(a")s — |a"> = J |">dp’ v5.0"), 
it follows that the matrix elements of U are identical with the ¢ functions 
(10.27) 

<p'|U|q"> = <a" |p'> = ea"); <a" |U* |p") = <P'la> = Ye"). 


By applying the representation P = —i(0/0Q) to the first equation 
(10.21), one finds that %(q") satisfies the differential equation 


(10.28) — t[0p%(9")/0q")] = p' op(Q") 
having the solution 
(10.29) PR(q") = conste'” * = <q"|U*|p'd. 


The constant of integration can be determined from the unitarity of U, 
(10.30) <p"|UU*|p'> = <p"|U|q'> dq'<q'|U*| "> 

jeonst|* { exp [i(p"—p')q']dq’ 
2n|const|*5(p’—p") = 8(p’—p") 


so that finally 
(10.31) <p'|U|a"> = (1/V ime, 


The components of |p> and |g> are thus connected by the Fourier 
transformation 


(10.32) 
[p> = (Uv 2x) fee |g"y dg’; |a"> = (1/V 2x) f ee” |p'y dp. 
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In particular, one obtains the ys function of |b> in p representation from 
(10.20) by the integration 


cy 


+ 
(10.33) Yoo) = (1/-V2ar) (y/) 4 J exp [—(y/2) (q—@)? +i(P —p) q]d9. 


o 


NOTES 


The classic source for discussion of the uncertainty relations is the 
work by Heisenberg [1]. 

Messiah [2] should be consulted, with particular reference to Chapters 
IV and VIII, on the time-energy uncertainty relation. 

Landau and Peierls [3] have given a provocative discussion of some 
implications of the time-energy uncertainty relation, and have pointed 
out, in particular, its consequences for the measurement of particle 
momentum in the relativistic case, the measurement of electromagnetic 
fields, and the measurement of position of particles that do not or do 
have mass. 
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“SECTION 11 


A Digression on Superfluidity 


The general transformation formula (9.13) is, of course, applicable to 
any object possessing energy and momentum and is not confined to the 
special case of a classical “particle” whose energy, as a function of its 
momentum k, is of the form w = k?/2m. In particular, the object may 
be a collective excitation or “quasi particle’? which is considered the 
carrier of excitation energy and momentum in a fluid. The energy of 
such a quasi particle, &(/) as a function of its linear momentum may, for 
example, have the form indicated in Fig. 11.1, If this is the case, then the 


wo 


k 


Fig. 11.1. Quasi-particle energy spectrum as proposed by Landau, 


fluid in its ground state, when traveling through a capillary, cannot lose 
energy to the walls of the capillary and thus experience friction, unless 
the velocity exceeds a certain critical value V,= tana, where tana is 
the slope of the straight line drawn from the origin which just touches 
the curve &(k). Historically, it was the existence of superfluid helium IT 
exhibiting just such a critical velocity which led Landau to conjecture 
the existence of quasi particles in helium II with an energy spectrum of 
the type indicated in Fig. 11.1. 

To understand the origin of this superfluid behavior in the presence 
of such a quasi particle energy spectrum, consider a fluid moving with 

vat 
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velocity Vo through a capillary, and label all quantities in this coordinate 
system S with no suffixes on them (Fig. 11.2). Alternatively, the same 


mM 
——$—$$—$—$$— $$ ___ 
S———— Vo 
ee 
~Vo 
System S: Wall stationary, System So: Wall moving with 
fluid moving with velocity Vo. velocity — Vo, fluid stationary. 


Fia. 11.2. Coordinate transformation to observer moving with the fluid in a capillary, 


experimental situation can be described by an observer whose coordinate 
system Sy is tied to the fluid, so that in this case the walls move with 
velocity —Vo. In frame Sp all quantities will be labeled with a subscript 
“zero” as suffix. 

Now suppose the fluid is at a temperature of absolute zero, i.e. it is in 
its ground state and there are no quasi particles present.* The initial 
kinetic energy of the fluid is 


inS: in So: 
(11.1) (initial) = 4MV@ E,(initial) = 0 


where M is the mass of the entire fluid. When viscosity is present the 
fluid should start to move in Sy and thus start to lose energy to the walls 
in S. In the excitation theory, fluid motion in Sp can appear only if 
internal motions of the fluid are gradually excited, i.e. if quasi particles 
begin to appear, owing to interaction with the wall. 

Suppose then that one quasi particle of momentum ky and energy 
& (ko) has somehow appeared by interaction with the wall in system So 
so that the energy of the fluid in So is now 


(11.2) By = Go(k)) (in Sp): 


To find the energy of that single quasi particle in system S one can now 
use the general transformation formula (9.13). Thus, if S moves with 


* Tho absence of quasi particles in the ground state is a hypothesis which is strictly 
tenable in absence of interaction between the molecules of the fluid only. See the more 
detailed examination of this point in Section 30. The argument in the present section can, 
however, be modified to accommodate possible presence of a pool of quasi particles which 
do not disappear even at a temperature of absolute zero. 
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respect to So with velocity Vo, then the effective Hamiltonian H,_ of an 
object of momentum P» is, in system S, given by 


(11.3) Hog = H6+(PoVo) 


where Hf = UH, U-", with U = exp (iPoVot). For “free” quasi particles, 
Hy is coordinate independent and thus Hf =H and the change in 
effective energy is entirely due to the kinetic effect of relative motion. 
One finds thus for the energy of the quasi particle in S 


(11.4) @(k-o) = Bo(ko) + (Ko Vo) 
and therefore the total energy of the fluid in S becomes now 
(11.5) E = &o(ko)+(KoVo)+4MV5 = (in 8). 


Now in order that viscosity be observed, the change in energy in S must 
be negative, which according to (11.1) and (11.5) is 


(11.6) B—Hi(initial) = &o(ko) + (Ky Vo) < 0. 


For a given value of ko, this quantity has its minimum when the vectors 
ky and Vo are antiparallel, corresponding to a quasi particle having 


Fig. 11.3. Quasi particle of momentum kp antiparallel to Vo, described in So. 


momentum parallel to the motion of the wall in Sq as indicated in Fig, 
11.3. One must, therefore, have certainly 


(1.17) Go(ko)—ky Vo < 0 or Vo > Go(ko)/ko. 


This inequality, which is necessary for the existence of viscosity, must 
be satisfied for at least some value kg of a possible quasi particle. Now, 
G@o(ko)/ko is the slope of the straight line connecting origin and the point 
G@o(ko) on the excitation curve. Its minimum value corresponds just to 
tan« as indicated in Fig. 11.1. If tan«40, then, for velocities 
Vo< V,=tan«, quasi particles cannot appear in the fluid and the fluid 
will be superfluid and not slow down by friction. The condition tana 4 0 
is obviously equivalent to the requirement that &(k) does not have 
vanishing slope at the origin. Therefore, in particular, whenever the 
lowest quasiparticles are phonons, for which &(&) = ¢yk (cy is the velocity 
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of sound), superfluidity will occur. To prove that a fluid becomes super- 
fluid at low temperatures one has thus to prove that only phonons can be 
excited at sufficiently low temperatures. This will be done for some 
special cases in Section 30. 

As was first shown by Landau, phonons account for part of the inertia 
of a fluid in which they are present. If phonons were the only excitations 
available, then one can, in principle, obtain the transition temperature 
separating superfluid from normal fluid behavior by computing the 
temperature at which the inertia of the phonons is equal to that of the 
entire fluid. In liquid helium I the contribution from the “roton” part 
of the excitation spectrum, corresponding to excitations lying near the 


Fic. 11.4. Possible actual excitation spectrum in liquid helium II. 


minimum of the curve in Fig. 11.1, dominates at temperatures above 
0.6°K, and the transition to normal behavior can be accounted for by 
thermal excitation of rotons with an energy & = 4 ~ 9°K. This value of 4 
leads unfortunately to a value of «, if computed after the fashion of 
Fig. 11.1, which gives critical velocities far above the observed critical 
velocities. 

This difficulty may possibly be resolved, if the mechanism of onset of 
viscosity at the critical velocity is due to an actual excitation spectrum 
of a form drawn in Fig. 11.4, corresponding to possible existence of a 
number of quasistable excitations of the roton type, but of higher effec- 
tive mass (i.e. narrower excitation line) and higher excitation energy 
A, > A, so that although they may not be excited thermally in any num- 
bers at superfluid temperatures, they may be excited mechanically, 
owing to the lower value of « as compared to the value of « in Fig. 11.1. 
Such quasi particles might be identical with the “long” rotons or vortex 
lines of a type first conjectured by Onsager and Feynman in connection 
with the problem posed by the critical velocity in helium II. Although 
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existence of what is labeled in Fig. 11,4 “thermal rotons” can be proven, 
and will be proven in Section 30, from first principles for special systems, 
a strict quantum mechanical xeistence proof for “long” rotons or vortex 
lines belongs to the large class of as yet unsolved problems. 

The excitation of long vortex lines by friction can be looked wpon as a 
mechanical ‘‘pumping”’ giving rise to occupation of energy levels 4, 
above the levels 4 which remain practically empty at temperatures below 
0.6°K, and, thus, to a situation analogous to the states of negative tem- 
perature familiar from similar situations used in the construction of 
masers. This raises the interesting question of whether one cannot induce 
transitions 4,—> 4 mechanically by feeding into liquid helium II, moving 
through a capillary at velocities above the critical velocity, oscillations 
of a frequency in resonance with the energy level difference 4,— 4, 
leading to the possible amplification of such oscillations by stimulated 
emission. This amplification should be absent as long as the liquid moves 
through the capillary at velocities below the critical velocity. In this 
connection, it might be worth noting that the excitation spectrum 
envisaged in Fig. 11.4 may be much too simple, and there may exist not 
just one but many types of vortex lines with levels 4,, all higher than J, 
and transitions may be inducible corresponding to transformation of one 
type of vortex line into another similar type, involving an energy change 
much smaller than the one corresponding to the entire breakup of a 
vortex line into thermal rotons. 


NOTES 


Landau [1] first proposed the existence of a quasi-particle spectrum, 
as in Fig. 11.1, to explain the superfluidity of liquid helium IT. 

Onsager [2] pointed out the possible existence of quasi-stable excita- 
tions other than thermal rotons envisaged by Landau, and Feynman [3] 
used the concept of quantized vortex lines of the type suggested by 
Onsager to give a quantitative account of the critical velocity in liquid 
helium IT. 

For a review of experimental work on vortex lines in liquid helium II 
see Vinen [4]. 

REFERENCES 


[1] L. Landau, J. Phys. USSR 5, 71 (1941). 

[2] L. Onsager, Nuovo Cimento 6, Suppl. 2, 249 (1949). 

[3] R. P. Feynman, Progr. Low Temp. Phys. 1, Chapter II (1955). 
[4] W. F. Vinen, Progr. Low Temp. Phys. 3, Chapter I (1961). 


a 


oe SO ANR 


_—— a SECTION 12 


Rotations of the Observer 


In analogy to the development of Section 9, consider next the rotation 
of an observer (1) with respect to an observer (0) by angle ¢ around a 
given axis n. It is claimed that the respective descriptions of an object 
in terms of states |b>” and |b) are connected by the unitary transfor- 
mation 
(12.1) [7 =Tr gl; Tag = ero 
where J is a hermitean operator J;, J2, Js representing the angular 
momentum of the object. 

To verify this consider first a single object without spin, whose angular 
momentum is purely orbital and shall be represented, in correspondence 
with the expressions known from classical mechanics, by the three 
operators 


Ji = Q2P3—QsP2 11Q2(0/2Qs) —Qs(0/2Q:)] 
(12.2) To = QsP1—QiPs = —i[Q3(0/0Q1) — 21(0/2Qs)] 
Js = Q:P2-QeP1 = —%[Q1(0/0Q2) — Q2(3/2Q,)] 
with the understanding that the eigenstates of Q, namely |q>, form a 


complete set and may be used as a basis for the description of the object. 
One verifies that these operators satisfy the C.R.s 


(12.3) 


J1J2—JoJ, = tJ (cyclically) or (J xJ] = J 
which are formally identical with the C.R.s (2.47) found for the spin 
angular momentum operators s;. 
Now let the axis of rotation be the q3-axis, so that the coordinates of 
the object as seen by the two observers are connected by the transfor- 
mation 


(12.4) 

i = Qicosp+Qssind; Qi = Qfcosd—Qzsind; 0Q,/6 = —Qy 
QE = —Qisind+Q2cosd; Q2 = Qisind+Qfcosd; 0Q2/8¢ = Q, 
Q = Qs; Qs = 8; 2Q,/0¢ = 0. 
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The expansion of the transformed state, 
(12.5) 

lad” = |a%,q3,0%> = |a> +$(d|a>/dd)4—0 + ($*/2!) (d?|a>/dh*)g-0+--- 
may now be written, using 


(12.6) 
(d\q>/dp)p-0 = & (2|97/20;) (0Q;/@4) 
= [Q1(8/8Qs) — Q2(8/0Q;)]|a> = iJ3\q> 
in the form 
(12.7) la” = & ("int Gayla = a> 


which verifies (12.1) for this special case. 
If the object is a spin pointing in direction #, on the unit sphere so 
that 


(12.8) [b> = (i (8/2) ) 


sin (9/2) e'? 
one expects the transformation operator for rotation around the (3-aXis 
to be given in terms of 83 = 03/2 by 


(12.9) APN Stahl 


The simple form of o, allows summation of the infinite series repre- 
sented by 7'y. Using 


1 0 7 0 
os = (4 a) and a= (, i) = 


one finds 


(12.10) 
Ts 


[1 + (1/2!) (if /2)2-+. . JL + [Cig /2) + (1/31) (if /2)° +. Jos 


ile 
cos (f/2) + isin ($/2) 03 = is fe mn) 


so that 


; (/2. 
(2.11) [oP = eo yet?) 


which differs, apart from a physically unobservable common phase 
factor, from |b) by the change in azimuth angle ¢” = ¢—¢. This verifies 
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again the suitability of the unitary transformation ( 12.1) for this special 
case, 

Of particular interest is rotation with uniform angular velocity w so 
that 


(12.12) nd = wt 
and the transformation operator becomes a function of time 
(12.18) Tey = ello, 


The effective Hamiltonian for the rotating observer will then be given, in 
analogy to the form encountered in case of uniform linear motion, by 


(12,14) Huq = H?+iTT = HT-(w-J). 
It is understood that w is the vector of angular velocity of observer (1) 
as described by observer (0). The angular velocity of observer (0) with 
respect to observer (1) is then obviously —w. 

In the following, the transformation formulae for the components of 
angular momentum are needed. They are, supposing the rotation is 


characterized by angular velocity w = —wk, 
TZ = cst J, eloat — J cos (wt) +S. sin (wt) 
(12.15) JE = etwht J, clot — —J sin (wt) +72. cos (wt) 


JF = ct J, cist — J, 
and follow by straightforward computation from the C.R.s (12.3). For 
example, with the usual abbreviation [J,,J,,] = Ti ITm—TmIns 
(12.16) 
ertodst J, cieolet — J, + (iat) [Fi Jy] + (iot)?/21[[J, Js] Sg] +... 
= Jy[1 —(wt)?/21-+.. .)] +S ofeot — (wt)®/3! +...) 
= J, c08 (wt) + Jy sin (wt). 

For any three operators J,, Jo, Js satisfying the C.R.s (12.3) one can 
prove the following theorem: 

There exist simultaneous eigenstates of one of the three operators, J, 
say, and of J® = J}+J}+J3, which are characterized by two quantum 
numbers j and m so that 
(12.17) 

J*\j,m> = j(j+ I j,m> and J'3|j,m> = mlj,m> 
where j has the possible values j = 0, 4, 1, 8, ..., and where for given j 


the quantum number m may assume the (2j+1) possible values —j, 
—j+1, ...,j—1, j. The proof of this theorem is given in Appendix 1. 
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As an application of the concepts developed for rotating observers, 
consider again the case of a spin magnetic moment in an external field 


(12.18) 
B = Bok+B,[icos (wt)—jsin(wt)]; w= —wk 
giving rise to spin magnetic resonance as explained in Section 7. The 


dynamical equation governing the magnetic moment p= ys in the 
laboratory system reads in the state picture 


(12.19) i[b> = —y(sB)|d>. 

As a first step towards solving (12.19), which has a time dependent 
Hamiltonian H = —y(sB), transform to a system (1) rotating around the 
laboratory system with angular velocity w. The transformed state 
(12.20) |b’ = eKontlp» 

satisfies then the equation 

(12.21) i0|b>'/Ot = Hy_|b>’ 

with 


(12.22) Hog = H'—(ws) = —ye)4(sB) eH! — (ws). 
This can be evaluated using formulae (12.15) as follows, 
(12.28) 
el(s)t(sB) est — e-wt(s, By +.B, (31 008 wt — sy sin wt) |e" 
= 83Bo+8, By 
so that the effective Hamiltonian may be written 
(12.24) on = —7(8Bon) 
where the effective field is now 
(12.25) Boy = [Bo—(w/y)]e+ Bri. 


Equation (12.21) contains thus a Hamiltonian which no longer depends 
on the time ft. The solution of this equation can be expedited by perform- 
ing a second transformation to a system (2) rotating around the direction 
of Big with angular velocity w 


(12.26) w= —yBoy 
having according to (12.25) the numerical value 


(12.27) w= Vai+(wo—w)? 
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where the characteristic frequencies w, and w, are defined, as in Section 7, 
by 


(12.28) wo = yBo; @, = yB,. 

With this choice, the twice transformed state 

(12.29) |b>” = eXwatlpy 

satisfies the equation 

(12.30) i0|b>"/ot = 0 

because the effective Hamiltonian governing the spin in system (2) is 
(12.81) Hg = Hig—(ws) = —7(sBiq)+y(sBeq) = 0 


the effective field Big being the same in both systems (1) and (2), so that 
Hy is invariant under the transformation (12.29), The solution of 
equation (12.19) is thus obtained immediately from the observation that 


(12.32) |b(é)>” = |(0)>” = |b(0)>’ = |8(0)> 
and 

(12.33) [B(t)> = e Kost e-iewart a(t” 

so that 

(12.34) |B(t)> = e Kost e-iwsyt] (0), 


For the explicit evaluation of the two transformations it is useful to 
introduce the angle @ between the effective field Bi and Bo, 


(12.35) cos @ = [By—(w/y)]/Bog = (wo—w)/w; sin@ = w/w. 
One has then 
(12.36) (ws) = —ws, and (ws) = —w(s,sin @ +8, cos @). 


To establish contact with observation consider a spin } and suppose 
at time t= 0 the spin is parallel to Bo, i.e. |b(0)> = |b,> = (3). The 
probability for spin flip is then [see (7.66)] with |b> = Q) 


(12.37) Pay = |<b-[B(t)>|® = [<b_[eHont e-torne, 9/2, 


To evaluate this expression observe that both |b,> and |b_> are eigen- 
states of —i(ws)t =(w/2)o,t with eigenvalues +iwt/2 and —iwt/2, 
respectively. One may thus write 
(12.38) 

<b_[eHont e-hwesye gs = Kent _|etwat yy = em int /2 <b_|e)tB, 
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and therefore 
(12.39) Pay = |<b_-lexp [i(2v/2) t(o, sin O + 3 cos @)]b,>|*. 


Expansion of the exponential operator, using o?* =I and of"! =a, 
gives 


(12.40) 
exp [i(w/2) t(o, sin ® +03 cos @)] = I cos (wt/2) +1(o, sin @ +3 cos @) 
x sin (wt/2). 


P4;) contains thus only matrix elements of the form 
0\/0 1\/1 
124i) Cforbds>=(2)() g(o) = 1s loaded = 93 
<b_|Ib,> = 0, 


so that finally 
(12.42) 


Pop = sin®@ sin® (wt/2) = 


2 2 2 
wy 9 (Vai + (wo) 
w+ (o-at os 


which is identical with the resonance formula (7.67) obtained earlier by 
a completely different method using the interaction picture. 


NOTE 


Rabi et al. [1] promulgated the use of rotating coordinates in the 
treatment of magnetic resonance problems. 


REFERENCE 
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Kh SECTION 13 


The Connection between Invariance Properties of the 
Hamiltonian and Conservation Laws 


Whenever the state describing a physical object is subjected to the 
unitary transformation 
(13.1) Pe OS PMs TP eave ty 


where S is a hermitean operator, and + a parameter characterizing the 
transformation, then the Hamiltonian of the system is transformed 
according to 


(13.2) Hae ie, 
The actual calculation of H” involves in general an expansion 


(13.3) 
H? = (I[+irS+...)H([-irS+...) = H+iz[SH] + (ir)?/21[S[SH]] +... 


One observes that if the Hamiltonian H commutes with the generator S 
of the unitary transformation 7’, then the Hamiltonian is invariant under 
the transformation. 

The converse cannot be inferred, i.e. one cannot conclude H” = H 
means [SH] = 0, except in case of an infinitesimal unitary transformation 


(13.4) T = I+ir8; T = I-irS 
where 7 is now an infinitesimal parameter. The condition 
(13.5) H? = H+i7[SH] = H 

is then indeed equivalent to 

(13.6) [SH] = 0. 


Now, it had been established earlier that the rate of change of the 
expectation value of any observable is determined by the commutator 
of the corresponding operator with the Hamiltonian. If the commutator 
vanishes, the expectation value of the observable will be constant in 
time. 
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One concludes: The operators representing constants of the motion of 
any physical system generate those infinitesimal unitary transformations 
which leave the Hamiltonian invariant. This corresponds to the well- 
known fact of classical mechanics in which the constants of the motion 
generate those infinitesimal canonical transformations which leave the 
Hamiltonian invariant. 

In particular, the results of Sections 9 and 12 may be applied to yield 
the conclusion that the law of conservation of momentum is a con- 
sequence of the invariance of the Hamiltonian under infinitesimal 
translations, and the law of conservation of angular momentum is a 
consequence of the invariance of the Hamiltonian under infinitesimal 
rotations. 

The Hamiltonian itself may be looked upon as the generator of an 
infinitesimal unitary transformation representing displacement in time. 
Equation (7.5) may be written for an infinitesimal time interval 


(13.7) ieenes = -iH|b(t)), 
which is identical with 

(13.8) |b? = |b¢+7)>, = (L+i78) |b(t)>s 
provided 

(13.9) S=-H 


[see also the statements following Eq. (7.16)]. 

The law of conservation of energy is thus a consequence of the in- 
variance of the Hamiltonian under infinitesimal displacements in time, 
if H is used to represent the energy of the system. This invariance will 
hold whenever H does not contain the time ¢ explicitly. 

These connections between conservation laws and invariance proper- 
ties of the Hamiltonian are of considerable help in selecting Hamiltonians 
suitable for the description of physical systems which are subjected to 
conservation laws. The invariance requirements act as a severe restric- 
tion on possible choices of a Hamiltonian. 

As an example, consider an object having momentum P, spin 8, and 
orbital angular momentum L, and try to find the most general Hamil- 
tonian having, apart from the familiar kinetic energy P*/2m, terms not 
higher than linear in each of P, S, and L. The requirement of invariance 
under rotations restricts the choice to the expressions 


(13,10); (18.11); (13.12) (P-L);  (B-8);_— (SL) 
(18.13); (18.14); (13.15) (P-[SxL)); (S:[P x L]); (L-[P x 8}) 
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because these terms are scalars which commute with the generator L+S 
of the rotations, as can be easily verified. If one adds the further require- 
ment that the Hamiltonian should be invariant under reversal of all 
motions, then the only remaining terms are (P-L), (P-S), (S:L), because 
reversal of motion changes the sign of each P, 8, L, so that the terms 
(13.13)-(13.15) also change sign under this transformation and thus do 
not satisfy the invariance requirement. By adding the final requirement 
that the Hamiltonian be invariant under inversion of the coordinate 
system one finds that only the term (S8-L) satisfies this invariance, 
because, under inversion of coordinates, P changes sign, but 8 and L do 
not, so that (13.10) and (13.11) change sign, whereas (13.12) is the only 
true scalar in this case. The Hamiltonian 


(13.16) H = B(8-L); 8 anumber, 


represents the energy of the so-called spin-orbit coupling which plays an 
important part in the establishment of energy levels in atoms and nuclei. 

The transformations of reversal of motion and inversion of coordinates, 
which have just been introduced in a rather casual fashion, warrant a 
more extensive treatment, to be given in Sections 14 and 15. They differ 
from the displacements and rotations in that they cannot be thought of 
as evolving continuously from the identity transformation, and therefore 
the corresponding infinitesimal transformations do not exist. However, 
invariance of the Hamiltonian under the unitary inversion operation 
leads in quantum mechanics to a peculiar conservation law which has no 
classical analog. The consequences of invariance under reversal of motion 
turn out to be somewhat more subtle, because reversal of motion must 
be represented by an antiunitary operator, and requires a special 
treatment beyond the one given in the present section. It will be found 
that existence of anti-unitary symmetry operators may lead to the 
existence of so-called superselection rules which in effect again guarantee 
the conservation of certain quantum numbers. 

Tt has been known for a long time that most processes which lend 
themselves to description in terms of classical physics are invariant under 
inversion of coordinates and/or under reversal of motion. The conse- 
quences of such invariances are often inconspicuous in classical physics 
and therefore had received little attention until some processes were 
discovered which, at first sight, appeared to violate inversion symmetry 
and the corresponding quantum mechanical conservation law, leading 
to a general re-examination of the symmetry properties of both quantum 
mechanical and classical descriptions of physical objects and processes. 
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NOTES 


Born et al. [1] first recognized the vanishing commutator between an 
observable and the Hamiltonian as the condition for the conservation 
of that observable. 

Wey] [2] stressed the connection with symmetry properties of the 
Hamiltonian. 

Goldstein [3, especially Chapter 8] can be consulted for a review of the 
connection between invariance of the Hamiltonian under canonical 
transformations and conservation laws in classical mechanics. 
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SOTO 14, 


The Invariance under Inversion of Coordinates and the 
Law of Conservation of Parity 


Any physical process or property involving an actual object is said to 
have parity if the mirror image of this process or property is again a 
possible physical process or property involving the same actual object. 

As an example consider a heavy top, spinning around an axis a, 
subject to a torque around an axis b perpendicular to a, which precesses 
around an axis e perpendicular to both a and b. 

The experiment may be done using a bicycle wheel mounted on one 
end of a short axle, with a flexible chain supporting the other end of the 
axle, as indicated in Fig. 14.1, The torque is realized by the weight W 


c 


Fia. 14.1. The precession of a heavy top and the mirror image of this process, 


of the wheel. Actual performance of the experiment shows that a, b, e, 
in that order, form a right-hand system if the usual conventions about 
right and left are adopted. 

If this experiment is viewed through a mirror parallel to the plane 
formed by a and e, one sees the spin of the wheel reversed, a’ = —a, the 
direction of precession reversed, e’ = —c, but the direction of the weight 
W and thus the direction of the torque is unchanged, b’ = b. 
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The process has parity, because the process seen in the mirror can be 
performed in actual fact by reversing the spin of the actual wheel, 
causing the actual precession produced by the same torque to also 
reverse its direction. The vectors a’, b’, e’, in this order, still form a 
right-hand system. 

As a second example consider the magnetic field produced by an 
electric current in a straight conductor. If a magnetic needle is placed 
above the current carrying conductor, it is found to be deflected so that 
the N pole of the magnet points in a right-hand sense around the current 
I, if the usual conventions about right and left are adopted. 


Fiq. 14.2. Deflection of a magnetic dipole by a current and mirror image of this process 
(assuming charge to be a scalar). 


On first sight one might think this effect does not have parity, because 
seen through a mirror parallel to the plane formed by conductor and 
undeflected needle the magnet is deflected in the opposite direction 
while the current direction remains unchanged. If one keeps in mind, 
however, that any magnet can be replaced by a suitable ring current, 
it is seen, as indicated in Fig. 14.2, that the mirror image of a magnet is 
a magnet with the signs of its poles reversed, provided the mirror image 
of a charge is again a charge of the same sign. 

One can conclude the effect has parity, because the deflection seen in 
the mirror can be obtained with an actual current and an actual magnet. 

It is important to note that purely electromagnetic phenomena have 
parity even if electric charge is pseudoscalar, i.e. if the mirror image of a 
positive charge is a negative charge and vice versa. As aresult, the straight 
current I’ would be reversed, as indicated in Fig. 14.3, but now the 
mirror image of the magnet would be a magnet with the signs of the 
poles unchanged. The mirror image of the deflection would thus again 
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correspond to an effect obtainable with actual current and an actual 
magnet. 

Experiments involving electromagnetic effects only cannot be used 
to decide whether an electric charge is scalar or pseudoscalar. The 
mirror image of any purely electromagnetic process can either be realized 
within the actual world using charges of the same sign, or it can be 
realized within an “antiworld” in which the signs of all charges are 
reversed. In this sense the electromagnetic field has a high degree of 
symmetry. 


Fic. 14.3. Deflection of a magnetic dipole by a current and mirror image of this process 
(assuming charge to be a pseudoscalar). 


Processes involving the weak interactions which cause, among other 
things, the 8 decay of nuclei, have less symmetry than purely electro- 
magnetic processes. The mirror image of a 8 decay cannot be realized 
within the actual world containing positively charged nuclei only, but 
it can presumably be realized in an antiworld in which all particles 
involved in the decay are replaced by their antiparticles and in which 
the signs of all electric charges are reversed. Thus 8 decay does not have 
parity, unless one admits experiments in the antiworld among the physically 
possible experiments. 

As a specific example, consider the B decay of a cobalt-60 nucleus 
which, under the emission of an electron and an antineutrino, goes into 
a nickel-60 nucleus. It has been found that the electron emitted goes off 
preferentially in a direction opposite to the magnetically aligned spin 
of the Co®® nucleus. Viewed through a mirror perpendicular to the 
direction of nuclear spin one sees the electron emitted parallel to the 
direction of spin, as indicated in Fig. 14.4, a process which does not 
happen in the actual world. The mirror image of the decay does, however, 
represent correctly the corresponding decay in the antiworld in which 
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Fig. 14.4, The 8 decay of Co® and its mirror image (charge scalar). 


the signs of all charges are reversed, as indicated in Fig. 14.5, and in 
which anti-Co® goes, under emission of an antielectron (or positron) and 
a neutrino, into an anti-Ni™. 


& 


Fia. 14.5. The 8 decay of Co® and its mirror image (charge pseudoscalar). 


Tf one wishes to maintain reflection symmetry as a universal principle, 
one is thus forced to assign an electric charge the transformation charac- 
ter of a pseudoscalar. From now on this will be done. Thus, if one wishes to 
describe the mirror image of objects which do have electric charge as an 
intrinsic property, one must augment the operation of coordinate 
inversion (labeled J7) by an operation of “charge conjugation” (labeled 
I), resulting in what is called in the literature the operation of “ combined 
inversion”’ (labeled X = ITI’). As will be shown in Section 19, one cannot, 
in general, represent J" (and thus 2) by a unitary operator, and the sym- 
metry under combined inversion does not lead to a simple “law of 
conservation of combined parity’. 

It should be stressed once more, however, that no inconsistency results 
if charge is treated as a scalar as long as purely electromagnetic processes 
are considered. Thus, coordinate inversion without particle conjugation 
is a valid symmetry operation as long as weak interactions are excluded 
from consideration. 
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As a first quantum mechanical example consider a single object having 
no internal properties such as spin, charge, ete., and which may therefore 
be described completely by a state vector |b> depending only on the 
coordinates of the object, labeled q, so that there exists a one-component 
y function y,(q) defined by 


(14.1) |b> = f la>da<alo> = f |a>vo(a) da. 


To investigate how one should describe the same object if it is viewed 
through a mirror, it is sufficient to consider only the inversion q > —q 
of the coordinates, because any reflection can be decomposed into an 
inversion and a proper rotation. Inversion of coordinates requires descrip- 
tion of the same object in terms of a transformed state 


(14.2) [b> = J |a>¥o(—4) dq. 


An attempt will now be made to connect |b>” with the untransformed 
state by a linear unitary operator JJ defined as 


(14.3) |b>” = IT|by 


so that JT represents the operation q > —q carried out on the y function 
in (14.1). To ensure that the parity of an object is embodied in the quan- 
tum mechanical description, it suffices to require the Hamiltonian of the 
object to be invariant under the transformation, 


(14.4) H = WHIP, 


because then the transformed state satisfies the same dynamical equation 
as the untransformed state, i.e. in the state picture from 


(14.5) 

H|b> = i|b> follows ITHII-'JI|by = iI|by or H|by™ = i|by™ 
so that if |b> is a possible state, then |b)” is also a possible state of the 
same object. 


Equation (14.4) contains the ‘‘law of conservation of parity,” because 
it may be written 


(14.6) TH —HII = 0 


which means the expectation value of JT is constant in time. This law is 
borne out by all experiments which do not involve weak interactions. 
On the other hand, since the Hamiltonian describing weak interaction 
processes such as the Co decay is apparently not invariant under pure 
coordinate inversion, one may refer to the peculiar correlations of the 
decay products as indicated in Fig. 14.4 as being caused by “non con- 
servation of parity” in that case. 
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Of particular interest are the eigenstates of the operator J7, defined by 
(14.7) T]|wy = Plu); P  anumber. 


The eigenvalues can be determined immediately if |«> is required to be a 
unique function of the coordinates, because in that case application of 
IT twice on |w> must be the identity operation (up to an arbitrary phase 
factor which by convention is chosen to be +1) 


(14.8) IT?|u) = P?|u> = |u». 

Hence there are two eigenvalues of J7, namely the roots of 

(14.9) =P? = 1 ie. P,=+4+1 and Pear) 
Eigenstates of I7 with eigenvalue +1 are called states of even parity 
denoted |u,>, and eigenstates with eigenvalue —1 are called states of 
odd parity denoted |u_>, 

(14.10) Tw. = +|u,> and TI|u) = —|uw. 

It is seen that the parity of a single object, describable by a unique state 


vector, is a dichotomie variable, and, in accordance with the concepts 
developed in Section 1, one may therefore use the representation 


(14.11) T= ( = )3 Ju.> = (hs Ju = (1): 


A state vector |b> need not be an eigenstate of 7, but it can always be 
decomposed 


(14.12) [> = Ja <usfB)-+[u-> <u> = Hol +)(G) +H0(—)(7) 
so that 


(14.13) |b>? = II|by = #o+)(5)—Hol-)(;)- 


It is understood that #,(+) and y,(—) are % functions only with respect 
to the two-dimensional parity space; with respect to all other attributes 
they are still state vectors, with n/2 components if n is the number of 
components of |b. One should, therefore, if one wishes to adhere to quite 
impeccable notation, write the direct product |y,(+)> x (), ete. 

A state with both (+) # 0 and y,(—) # 0 is called a state of mixed 
parity. One can always choose two linear combinations which span 
the same space as the vectors |b> and |b)”, and which have definite 
parity, namely 


vo+)(5) = a1l0>-+ 10>") 
(14.14) ‘ 
vst—)(7) = 4110>—[e>". 
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Eigenstates of the Hamiltonian H belonging to nondegenerate eigen- 
values w are always eigenstates of JT, i.e. have definite parity. Indeed, if 
(14.15) H\u> = w|u> 
then also 
(14.16) IZHIT~IT|u> = wII|u> or HI |\u> = wIl|\uy. 
For nondegenerate w this can be true only if 
(14.17) I|uy = P\uy; | =P anumber, 


which by definition (14.7) shows that |w) is an eigenstate of IT. 

The eigenstates of orbital angular momentum |/,m) of an object also 
have definite parity. In polar coordinates these states read (see Appendix 
1) 


(14.18) |L,m> = f |, 942 Yin(O, @) 
where the y function 
(14.19) — Yi (9, p) © sin™ # (cos 9+ a cos"? 9+...) em, 


Now the inversion q — — q corresponds in polar coordinates to the trans- 
formation r, 3, 9 >r, 7—¥, 7+ @ so that 


(14.20) 
TTY im (0, 2) ¢ sin (7 — 9) [cos (7 — 8) +a. cos—"2(a7 — 9) +... Jim), 
Since for 
; ; Saye [eos e if m even 
leven: cos" (47-3) = eat ifm odd 
(14,21) 


‘ = _ {—cos™ #9 if m even 
lodd: — cos™(7—#) = | 4008-™G GE modd 
and since further 
‘ f +sin™ peime if m even 
™ (oe — im(rt+p) — 
(14.22) sin™ (7B) e' Se if m odd 


one has 
(14.23) ITY im (0, 9) = (—1)' Yim(®; 9)- 


This means a state of odd / has odd parity and a state of even J has even 
parity, regardless of what the value of m may be. 

One can classify generally all observables as even or odd depending on 
whether their operators do not or do change sign under inversion of 
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coordinates. Any even operator A( +) is thus defined as 


(14.24) ITA(+)II-* = A(+) 
and any odd operator B(—) is defined as 
(14,25) JIB(—) II"! = —B(-) 


independent of the transformation properties of A(+) or B(—) under 
rotations or translations of coordinates. It is easily verified that A(+) 
and B(—) may be represented in the representation (14.11) by 


(14,26) A(+) = (5 1) 
(14.27) B(-) = al} 0) 


where now A and B contain the operations on all other variables of the 
object, and the 2x 2 matrices operate on the parity variable alone. In 
any general state (14.12), one finds, thus, for the expectation values of 
A(+) and B(—) the expressions 


(14.28) A(+) = <b|A(+)|b> = <ho( +) A |ho( +)> + <hol—)1A lyo(-)> 


(14.29) B(—) = <b|B(—)|b> = <ifo(+)| Blo —)> + hol —)| Bl yo(+)>- 
From (14.29) one reads immediately the fundamental theorem: 


The expectation value of any odd observable vanishes in any state of 
definite parity, i.e. in a state for which either (+) = 0 or ,(—) = 0. 


Example of an odd observable is the linear momentum of an object, 
examples of even observables are angular momentum and energy of an 
object. 

From the foregoing argument, one concludes that an object in a state 
of definite parity may possess even observables, such as angular momen- 
tum and/or energy, but that it cannot possess odd observables, such as 
linear momentum. In a state of mixed parity, however, an object may 
possess odd observables. 

Prior to the discovery of the correlation between the magnetic moment 
of the Co®° nucleus and the direction of electron emission in its 8 decay 
(see Fig. 14.4), electric charge had conventionally been assumed to 
transform under inversion as a scalar (see Fig. 14.2). With this assump- 
tion, Maxwell’s equations lead one to consider as odd observables, 
magnetic pole, electric dipole, magnetic quadrupole, etc., and to consider 
as even observables, magnetic dipole, electric quadrupole, etc. Accord- 
ingly, the absence of any elementary magnetic poles and of any electric 
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dipole moment in the ground state of any object (usually considered a 
nondegenerate eigenstate of H and thus having definite parity) was taken 
as sufficiently explained by inversion invariance, which does not permit 
existence of odd observables, such as magnetic pole and electric dipole, 
in states of definite parity. 

However, since the advent of the Co experiment, there is now reason 
to assign to electric charge the transformation property of a pseudoscalar, 
in accord with the explanations given in Figs. 14.3 and 14.5, and the 
classification of multipole moments given above is reversed. This raises 
the following questions: 


(i) If indeed electric charge, magnetic dipole, etc., are, respectively, 
pseudoscalar, vector, etc., ie. odd observables, how is it possible that 
many so-called elementary particles and other simple objects possess 
these observables in their ground state? 

(ii) If magnetic pole, electric dipole, etc., are even observables, why 
are they never observed in the ground state of any “elementary ”’ object? 
This question has its root in the generally valid observation that nature 
usually realizes any possibility open to her on general principle, an obser- 
vation which is sometimes stated aphoristically as, ‘““Anything that is not 
forbidden is compulsory.” 


A consistent, though not necessarily correct, set of possible answers to 
these two questions is the following: 


(i) The very fact that there are ground states of objects with non- 
vanishing electric charge means such ground states cannot be states of 
definite parity. This is possible provided the ground state is degenerate 
with respect to the energy of the object. Such a situation may arise quite 
generally if to every ground state of energy w9 describing an object with 
given charge there exists another such state of the same energy wo 
describing the same kind of object, but with opposite charge. If this point 
of view is accepted, then the only “elementary” particles that may 
possess definite parity are the neutral photon, the neutral pion, and the 
neutral kaons. Should any of these particles indeed have definite parity, 
then it cannot possess odd observables such as magnetic dipole moment, 
etc. The conditions under which the concept of the intrinsic parity of 
elementary particles is meaningful will be examined in more detail in 
Section 26. 

(ii) The nonexistence of magnetic dipole, electric dipole, ete., in 
“elementary” systems may be due to some other invariance require- 
ments, such as invariance under reversal of motion, which will be taken 
up in the following Section 15, and/or invariance under particle conjuga- 
tion, to be taken up in Section 28. 
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The division of states into states of even and odd parity, corresponding 
to eigenvalues +1 and —1 of the operator JT, was based on the require- 
ment that the state vector of an object having no intrinsic spatial 
properties, except position, be a unique function of the coordinates. 
This means /7? can differ from the unit operation only by a phase factor 
(a constant of modulus 1) which had been chosen such that JJ* = J. If 
one tries to assign parity to objects having spin j = 4, one encounters an 
additional ambiguity stemming from the double-valuedness of spin 
states as a function of the coordinates. Depending on whether one con- 
siders application of J7 twice as amounting to a rotation of angle n2x 
with even, or amounting to that rotation with n odd, one obtains 


(14.30) 

TT*|u'> = P2\u') = |u') or TT"? |u") = P**lu" = —|u"> 
each case corresponding to a possible representation in which 
(14.31) Pi = +1; Py = —-1 or Py = +8; Pe = —t 
Accordingly one may represent the operator JT by either 


(14.32) wr =(j ei a r= () eo 


if the parity states are represented as usual by |w/.> = |w> = (9) and 
|u“> = |w”> = (2). Since a phase factor is always at one’s disposal in any 
unitary operator, however, no physical restriction is obtained if the 
representation JT’ is used, but it should be kept in mind that this is a 
convention. 

As will be shown in Section 26, the only experimental information 
available about the parity of two interacting objects is whether they have 
the same parity or whether they have opposite parity. It is apparently 
impossible to devise an experiment which would allow determination of 
the absolute parity of an object. One must therefore make the further 
convention of assigning to some object a certain parity and then deter- 
mining the parity of all other objects relative to that reference object. 
The standard convention is to assign even parity to the vacuum state. 


NOTES 


Wigner [1] gave the first clear formulation of the quantum mechanical 
law of conservation of parity as derived from inversion symmetry, 
pointing out that it has no analog in classical mechanics and that it is 
the origin of selection rules discovered in atomic spectra by Laporte [2]. 
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(Incidentally, Wigner did not use the term “parity” in his original 
paper, and Pauli in his famous encyclopedia article calls this quantity 
the ‘‘signature.”’ The present author has been unable to ascertain who 
invented the term “parity” for the eigenvalue of the inversion operator 
IT.) 

Wey] [3] gives an account of the somewhat inconspicuous role played 
by inversion symmetry in classical physics. 

Wu et al. [4] discovered the nonconservation of parity in B-decay of 
Co® by performing an experiment suggested by Lee and Yang [5]. The 
possibility of re-establishing reflection symmetry in case of parity non- 
conservation through admission of combined inversion as a symmetry 
operation had already been suggested by Yang [6] in his report to the 
International Conference in Theoretical Physics, Seattle, 1956. See also 
Landau [7] and footnote 9 in the paper by Wick et al. [8]. 


REFERENCES 


[1] E. Wigner, Uber die Erhaltungssiitze in der Quantenmechanik, Nachr. Akad. 
Wiss. Goettingen Math.-Phys. Kl. p. 375 (1927). 

[2] O. Laporte, Z. Physik 23, 135 (1924), 

[3] H. Weyl, “Symmetry.” Princeton Univ. Press, Princeton, New Jersey, 1952. 

[4] C. S. Wu, E. Ambler, R. W. Hayward, D. D. Hoppes, and P. R. Hudson, 
Phys. Rev. 105, 1413 (1957). 

[5] T. D. Lee and C. N. Yang, Phys. Rev. 104, 254 (1956). 

[6] C. N. Yang, Rev, Mod. Phys. 29, 231 (1957). 

[7] L. D. Landau, Nucl. Phys. 8, 127 (1957). 

[8] G. C. Wick, A. S. Wightman, and E. P. Wigner, Phys. Rev. 88, 101 (1952). 


—— 


e 


lean etd nh “ag” piven Ai 
eiiweap wht aa ctyites aldtqetopery ern turl 
ty capture Co ellen (ned aah lm 

erepge deena 1) hy eccergi> ot a8" ines” Him + 


Decay eh!) erent Maalienanos fel 1 ene 


ry 

4 oe é, hs mupromesetanl ren yt 
7 1G] we? 
SN ee eae : saan oe ted 
naa @ 48 roteerres J i 
wl nt Hoge ad Ai Til) geet 
ialamen 000 alice 

(e) 2 eda ol ey ae 


Ay Via aicoiemanerie 
(rai 8 Sve! ‘As 
BA eee eS eed Tee ET reste Si 
a ee 4 2 wwe A. f 7 
¢ ats faieo noha ae 
aallagtias ee Yoh gies . 
1 Pes) reepeets 
feety 
pees) ir Ae all el ew 


———_—_—_—_-HA-> TT SECTION 15 


Invariance under Reversal of Motion 


Another symmetry property with which all objects and processes 
occurring in nature seem to be endowed is “‘reversality.” Any physical 
process or property involving an actual object is said to have reversality 
if after reversal of all motions one obtains again a possible physical 
process or property involving the same object. 

If reversality were not a universal property of physical reality, there 
should exist stationary situations which are invariant with respect to 
all symmetry operations except reversal of motion. For example, if one 
were to observe in a 8-decay process a state in which the spin of the 
nucleus s, the velocity of the emitted electron v,, and the velocity of the 
emitted y-ray v, are correlated as indicated in Fig. 15.1, so that Wii Nes 


Em 
a 


Fic. 15.1. Example of a state which does not have reversality. 


and s, in this order, form a right-hand system, one would have to infer 
that this state does not have reversality, unless there exists an, as yet 
undiscovered, other form of matter, metamatter (say), distinct from 
antimatter, which is in every respect the motion-reversed image of 
ordinary matter. Indeed, if one reverses in Fig. 15.1 all motions, one 
obtains a state indicated in Fig. 15.2, which is essentially different from 
Fig. 15.1 because the reversed motions v,, ¥,, and §, in this order, form a 
left-hand system contrary to the hypothetical experimental situation of 
Fig. 15.1. It should be noted that the state pictured in Fig. 15.1 has 
parity, because its mirror image, as indicated in Fig. 15.3, is again a state 
in which the reflected motions vy, V,, 8’, in this order, form a right-hand 
system. 
99 
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To date all experiments designed to detect states of the kind indicated 
in Fig. 15.1 have had negative results. In particular, the B decay of 
Co®° as described in Fig. 14.4 has reversality, because the reversed state, 
in which the direction of both nuclear spin and electron velocity are 


Pa 


Fia. 15.2. The state obtained by reversal of motion in the state described by Fig. 15.1. 


reversed, is again a possible state with the electron moving in the direc- 
tion opposite to the direction of nuclear spin. 

The operation of reversal of motion for a nonstationary process can 
be visualized by imagining a moving picture taken of the object under 
consideration, and then having the film reeled off backwards. Since this 
operation is formally equivalent to changing, in all expressions depending 
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Fia. 15.3. The state described by Fig. 15.1 and its mirror image. 


on time, the sign of the time variable, the operation may be referred to as 
“time reversal.” Thus reversal of motion means not only the reversing 
of all motions in a given state, but also the reversing of the sequence of 
states. 

In collision processes the reversality of both initial and final state may 
lead to a very general relation between the process and its “‘inverse”’ 
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process, in which only the sequence of final and initial state is inter- 
changed. This relation is known as the “principle of detailed balance” 
and will be taken up in Section 28. 

One must not confuse reversality of a process with the thermodynamic 
notion of reversibility of a process. A process will have reversality as 
long as the process with all motions reversed is in principle a possible 
process, however improbable it may be. The mechanical mixing of a 
deck of cards has reversality although thermodynamically it would have 
to be classified as an irreversible process. It is an interesting and open 
question whether the breakdown of invariance under reversal of motion, 
and thus of the principle of detailed balance, if ever observed, would 
have any thermodynamical consequences at all, and if so, how far 
reaching these would be. 

Observables can quite generally be classified as time-even or time-odd 
depending on whether they do not or do change sign under time reversal. 
By definition of time reversal all even derivatives of the coordinate Q 
of an object, Q, Q, ete., are time-even, and all odd derivatives, Q, Q, etc., 
are time-odd. If the usual assignment of time-evenness to the mass of 
an object is taken for granted, one can infer from this that linear 
momentum P and angular momentum J are time-odd, whereas the 
energy H is a time-even observable, 

Regarding the transformation properties of electromagnetic observ- 
ables under time reversal, Maxwell’s equations contain only the informa- 
tion that electric charge and magnetic pole must transform oppositely 
under time reversal, so that if one of them is time-even the other one 
must be time-odd. In absence of any experimental information of other 
than electromagnetic origin which could decide whether electric charge 
is time-even or time-odd, charge can be considered as time-odd, in 
accordance with a suggestion by Feynman who has proposed looking 
upon positrons as electrons “running backward in time,” but it should 
be stressed that at this stage this is purely conventional. (In fact, evidence 
will be presented later indicating that electronic charge should be 
considered as time-even, because the lepton number characterizing 
electrons and positrons, as well as neutrinos and antineutrinos, is time- 
even.) Accordingly, electric field E and electric dipole moment p may 
tentatively be considered as time-odd observables, and magnetic field B 
and magnetic dipole moment p as time-even observables. 

Although the definition of time-even and time-odd hermitean opera- 
tors, representing time-even and time-odd observables in quantum 
mechanics, makes sense, the division of states into states of even and odd 
“reversality,”’ in analogy to the division of states into states of even and 
odd parity, is obscure for the following reasons. 
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It is clear that the transformed state under time reversal @, 
(15.1) |b" = Old) 


cannot be connected with |b> by a linear unitary operator, because if 
one demands that time-even energy H, and time-odd linear momentum 
P and angular momentum J, transforms according to 


(15.2) @H@" = H; Ope = -P; 0JO-1 = -4J, 
then the fundamental dynamical equation 


(15.3) H\b> = i0|b>/at 
and the canonical C.R.s 
(15.4) [PQ] = 3; [J x J] = id 


are not invariant under the corresponding unitary transformation, since 
only one side of each equation changes sign. 

One can, however, satisfy the invariance requirements demanded by 
the reversality of physical reality with an antiunitary operator 


(15.5) @=TK 


where 7’ is a unitary operator representing the transformation t > —t, 
and K is the antilinear operator of complex conjugation, applied to all 
numbers in the state vectors, so that in terms of the ¢ function charac- 
terizing the state |b> 


(15.6) 
Alb = @ { |a>dayola,t) = | |>daT¥F(a.t) = f |grdavsa, -0. 


The notion of eigenstates of the reversal operator @, in analogy to the 
eigenstates of the linear inversion operator 7, is thus obviously not a 
sensible concept in general. The fundamental dynamical equation 
satisfied by |b>” follows now from (15.3) by application of (15.5), 


(15.7) @HO-O|by = Gia|b>/at = —iO a|b/at = é(2/At) (Ody) 
which for @H@-! = H can be written 
(15.8) H\b>" = 1 0|by? /at 


and is seen to be identical with Eq. (15.3) satisfied by |b>. The canonical 
C.R.s (15.4) are also invariant under (15.5), because the operator Ks 
changes the sign of the imaginary unit which appears on the right-hand 
side of each Eq. (15.4), so that application of @ in accordance with the 
transformation properties (15.2) of P and J leads to no change in these 
equations. 


15. Invariance under Reversal of Motion 103 


Although a quantum number associated with 0, analogous to parity, 
does not make sense in general, one can characterize a state vector |b) 
by the eigenvalue of the operator @?, which is a linear unitary operator. 
The physical requirement that the operation of time reversal @ carried 
out twice should result in the same state, 


(15.9) @7|by = «b>; el = 1, 


means that all states must be eigenstates of @*. Since the operation of 
complex conjugation carried out twice is equivalent to the identity 
operation, K* = J, one can write 


(15.10) 6? = TRTK = T7*K* — 7T* = ef. 
From the unitarity of 7 follows further that 
(15.11) @=l 


because for a unitary operator one has 7-! = 7'+ =7*, so that the last 
equation (15.10) can be written 


(15.12) T* = T* 
and by transposing this equation once again one obtains 
(15.13) T* = T* = 2 T* 


which can be true only if (15.11) holds. The possible eigenvalues of 0? 
are therefore « = +1 and e = —1. Multiplication of © with a phase e™ 
does not affect 9, of course, because e'* Oe @ = ee @2 — @?, 

It can be shown quite generally that states describing an object having 
integer total angular momentum j and no other internal attributes 
belong to the eigenvalue «= +1 of @, and that states describing an 
object having half-odd integer total angular momentum j and no other 
internal attributes belong to the eigenvalue «= —1 of 6*. (For the 
purpose of the present work it actually suffices to verify this statement 
only for the values j = 0, 4, 1.) To do so, use will be made of a construction 
of the operator @ in the representation in which the component operators 
of angular momentum J,, J; have real matrix elements, and the com- 
ponent operator J, has pure imaginary matrix elements, as explained 
in Appendix 1. 

The transformation equation (15.2) for J, in components, 


(15.14) @J,+J,0 = 0; OJ,+J2O = 0; OJ,+J,0 = 0 
requires the unitary operator 7’ defined by (15.5) to satisfy the relations 
(15.15) TJ,+J,T = 0; TJ,—J2T = 0; TI,+J,T = 0 
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which are solved by a rotation of amount 7 around the 2-axis, as is 
intuitively obvious, 

(15.16) he gee me TO} oe Gs, 


Indeed, by expansion and use of the C.R.s for the J; after the fashion of 
(12.16) one finds 


TIT = &™J,e7™ = J cosa+Jssina = —J; 
(15.17) TI ,T-* = Joe = Jy 

TI;T = &™J,ei™ = Jscosa—J sina = —Js. 
In this representation the operator 6? takes now the form 


(15.18) 
O2 = TT* = cimieins® — gti — [4 YimJy +[(2im)*/2N]JR+... 


Now consider an object in an eigenstate of the total angular momen- 
tum J® characterized by the quantum number j so that J* = j(j+1)I. 
It is always possible to perform a unitary operation which leaves 0? 
invariant, but makes J, a diagonal matrix, so that it can be replaced by 
its eigenvalues which differ from the value j of the total angular momen- 
tum only by integers. Thus, since exp [2zi(integer)] = 1, 9? is identical 
with exp[27ij] which is equal to +1 for integer j and —1 for half-odd 
integer j.* 

It is perhaps instructive to verify this general result for the special 
cases j = 0, j = 4,j = 1. For j = 0 one has trivially 


(15.19) 6% j = 0) =I. 
For j =4 one has J, = }(¢ >‘) and thus J3 = 4°, so that 
(15.20) Oj = 4) = Icos7+2Jysing = —I. 


For j = 1 one has 


1 (9 -i 0 if} 0 -1 
I=yli 0 -i);  E=5{ 0 2 Of; Spare 
2N0 aie 0, Fe ns | 


so that 
(15.21) @®(j = 1) = 1+ iJ gsin (27) —J3[1 — cos (27)] = I 


and so on. 


* The author is indebted to Mr. David Pink and to Mr. Patrick Whelan for this simple 
line of reasoning, 
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The cases « = +1 and ¢ = —1 require now separate treatment. 

(i) «= +1. The object can be described in terms of an orthogonal set 
|b, of eigenstates of a complete observable B, for example in terms of 
the eigenstates |l,m> of J* and Js, which are simultaneously eigenstates 
of the operator 0*, 


(15.22) 6? |b,> = |b4>. 


It follows that |b,>” =@|b,> is also an eigenstate of O* with eigenvalue 
+1. So is 


(15.23) even? = e(|4>+|b,>); real, if O]b,> # —[by>; 
unless it vanishes, and one has thus 


(15.24) O|teven> = |even> 


which explains the label “even”. If it should happen that for aspecific 
state, |b, say, @|b.> = —|b/.>, one chooses 


(15.23') |teven? = [043 if @IB4> = —[04>; 


and Kq. (15.24) is again valid. Similarly, one can construct from the set 
|b,> a set |toaa> by 


(15.25) |aoaa> = ¢(|b4>— O|b,>); creal,if O|b,> # |b,> 


(15.25') |uoaa> = i]6,>; if 64> = [04 
which satisfies 
(15.26) O|uoaa> = —|Moaa>- 


This possibility of finding even and odd states is, contrary to the situation 
under the inversion operation IT, not equivalent to the possibility of labeling 
the states by a physically meaningful quantum number characteristic of 0, 
such as parity in case of IT, because one can transform an even state into 
an odd state under ® and vice versa simply by multiplication of |uy with a 
physically unobservable phase factor i. The quantum number e¢ charac- 
teristic of @*, however, is not affected by such a change in phase and 
should therefore be a physically meaningful label of a state. 

Although the states |w) are not eigenstates of @ in the usual sense, they 
may be referred to as “invariant states” of @, and one has the general 
theorem: Ina state invariant under © the expectation value of any time-odd 
operator vanishes. 

Indeed, if @4@~' = —A, then in an invariant state |w>, so that 
O|u> =e|u> with arbitrary «, 


(15.27) A = <ulAlu) = —<ulOAO uy = —<ule* A e|uy = —A 
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requiring A =0, independent of the phase «. This statement is thus 
invariant under unitary transformations, even though any particular 
state |w>, say |t%eyen>, allows only real orthogonal transformations 
|u> +R|u> with R = R-! if @ = is to remain valid. Under a unitary 
transformation U+ = U1 the operator @ transforms into 
0’ = VOU" = UT(U-)* K = U(U)* 1" K 
with 
Tl =0tTU (or 6 = UUT'R). 

If Feynman’s assignment of time-oddness to electric charge and electric 
dipole moment were correct, one could, for example, conclude from this 
theorem that an elementary particle which has integer spin, and no 
other intrinsic attributes which might invalidate the labeling with the 
quantum number = +1, must be electrically neutral and cannot possess 
any electric dipole moment either. 

If it should happen for a specific state |b‘,> that @|b{.> = |b{>, then 
also @|b{> = |b/.>, and the two invariant states 


|teven> = ¢(|0>+O|b4>) = o(|64.> + [b£>) 
|Ueven> = c(|b4>+ O|bY>) = o(|bE> + b4>) 
are accidentally identical. This apparent incompleteness can always be 


removed by choosing #|6/,> and |b{> as basis, because then the invariant 
states 


(15.28) 


|Peven> = e(i|b,> + Gib.) = te(|b,> —|bf>) 
|Peven> = o(|bf>+ Olb4>) = e(|bE> + |b4>) 


are properly orthogonal. 
It is perhaps instructive to have the foregoing statements illustrated 
for the case j = 1 in which one has explicitly 


(15.28') 


(15.29) T(j = 1) = e'™: = [+iJ,sin7—J3(1—cos 7) 
0 Gk 
= 121 = (0 a7 0) 
1 0 0 


Taking as basis the eigenstates of Js, namely 


1 0 0 
(15.30) |1, +1) = (\): |1,0> = (\: [l,-1) = () 
0 0, 1 


one finds 


(15.31) 
91, +1) =|1,-1; — O|1,0> = —|1,0>; |, -1) = |1, + 
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and the accidental cases (15.23’) and (15.28) are realized. One chooses 
therefore as basis instead, the set with the phases 


i 0 0 
(15.30') 1, +1) = (\): [1,0> = () [1,-1> = (" 
0 0, 1 


which yield the three orthogonal invariant states 
|u(+1)> = (1/V2)(|1, +1>+O[1, +1) 


v 
= (1/V¥2)(|1, +1>-i]1, -1>) = val ) 


0 

(15.32) — |u(0)> = |1,0> = (' 
Ju(—1)> = (1/02) (1, - 1 +O|1, — 1) 

1 

= (1/V2)(|1, -1>—a|1, +1) = al’ 


The fact that the matrix 7’ is symmetrical for j = 1, as given in Eq. 
(15.29), is a special case of the general fact that 7 is symmetrical whenever 
e= +1. This follows from 0?=77*=I by multiplication with the 
transposed operator 7' = 7'** yielding 


(15.33) T7*T+** =T and T=T for e=+41 


by the unitary of 7’. Similarly, for the case «= —1, which will be taken 
up now, one obtains 


(15,34) T=-T for e=-1 
which means in this case 7' must be a skew-symmetrical matrix. 
(ti) «= —1. The object can be described in terms of an orthogonal set 


|b_> of eigenstates of a complete observable B, for example in terms of 
the eigenstates |j,m> of J* and Js, which are simultaneous eigenstates of 
2 


(15.35) 6b.) = -|b.». 


Tt follows again, as in the case « = +1, that |b)” = O|b_) is also eigen- 
state of @* with eigenvalue — 1. However, |b_>” is always orthogonal to 
|b->, because 
(15.36) 

<b_|Ob_> = <0*b_|Ob_> = —<b_|Ob_> (and therefore = 0) 
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where the first equality sign follows from <a|b> = (@b|@a> and the 
second from (15.35). 

Denoting the two orthogonal complements of any state |b_> by |b0> 
and |b") and defining them with a convention of phase so that 


(15.37) |b") = i@|b’> and |b’) = ** O|b"> = —10|b") 


one can thus say the eigenvectors of 9? are twofold degenerate, and the 
transition from the set |b“ to the set |b”> cannot be effected by a unitary 
transformation. 7'0 accommodate the reversality of an object one requires 
therefore, in the case « = —1, for its complete description an additional two 
dimensions in the abstract state vector space. One can satisfy Eqs. (15.37) 
by the 2n-dimensional representation 


(16.87') |B) = [b> (6) le) = jbo? x(?) |b” = TK|by 


where n is the dimension of the space spanned by the eigenvectors |b) of 
B, provided in reversality space, spanned by the vectors (5) and (?), the 
operator @ acquires an additional 2 x 2 matrix and is thus altogether of 
the form 

0 -I 
(15.38) Qa Taka Toe rx} i 
where 7 represents the operation ¢ > —t¢ applied to the components of 
|b> only. Indeed, 


iO|b> = apr(; =altal = ipye()= |b" 


-16|b2) = =i1>(7 Bre = -iiP>(j) = |). 


Operators that are even or odd under reversal of motion also acquire in 
reversality space an additional 2 x 2 matrix, which for hermitean opera- 
tors must be taken as the unit matrix, 


(15.37) 


(15.39) | A(even) = A (5 i B(odd) = Bx() - 


because then 
(15.40) 


@A(even)— A(even) 0 = ( : a 


@A-AO 0 
=0 provided @A—AO=0 
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and 


(15.41) @B(odd) + B(odd) @ = ( : Oe a) 


—(BO+@B) 0 
= 0 provided @OB+BO = 0. 
For the purpose of illustration consider the case j = 4 in which one has 
(15.42) 
T(j = 4) = &™» = Icos (m/2)+ 2tJ sin (n/2) = 2iJ = iz v 0) : 


Taking as basis the eigenstates of J, namely 
1 0 
(15.43) fox) = Hh +8=(9)5 |e» = Lb - 4 = (() 


one has the representations 


w>=()x()=(2); m= (°)x(2) = 


oocore 
oroso 


3 


= 
ris 
Vv 
I 
— 
or 
ae 
x 
ae 
=O 
Seer 
Il 
oo? o 
7 
ws 
Vv 
ll 
—— 
“So 
—— 
x 
—— 
eS 
= 
ll 
- ooo 


Wy ait 
ae oe ee el ee ee 
bei o*(: 0) = Oma OnmarO 

= i 


Transition to a different set of basis vectors |a_)>, say, can be carried out by writing the 
vectors |aj;_> as linear combinations 


(15.45) |a_> = x Valbr->+2 Wrylbx—> 


so that the set |a/_), if it is to satisfy (15.37), is accordingly expanded in the fashion 
(15.46) laj_> = 10|a,_> = “3 Wilde +E Vixlbe=>. 


Hence, in reversality space the transformation matrix S connecting the set 


((3s) 


({32)> so emt (32) = 8 (3) 


with the set 
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—w* ye 
In order that |a>, |a>” be orthonormal if |b>, |6>” are orthonormal, S must be unitary, 
(15.48) SSt=I- or 


(15.47) Sie ( is eli V and W n-dimensional matrices. 


VoOW\( V+ —wet) _ ( V¥++WwWwt -VW+wV \_ (1 oO 
—we ve)\wt yar —Wevt+vewt weer4 veer) = lo 1 
requiring 
(15.49) VV++WWt=I and VW=WYP. 


It is a well-known fact of algebra that if (15.49) is satisfied, S becomes a “symplectic” 


matrix, which means it leaves 7’ = (? ~}) invariant, in the sense ST'S = 7’. In other words, 
any other set |a>, |a>” can be obtained from the basis |b>, |b>” by a unitary symplectic 
transformation. 


Finally, since any even hermitean operator A = A*, @AQ™ = A satis- 
fies generally, because of @* = —IJ = —@~'@ and <a\b> = <0b|@a), 


(15.50) <b |AbL> = (@ABL|ObL> = <@AO- Ob! | Ob!) 
= (A@b.|ObL> = <AbNbY) = <b2| Ab") 


and an odd hermitean operator B = B+, @BO-1 = — B satisfies, by the 
same argument, 


(15.51) <0|BbL> = —<b"| Bb") 

it follows that in a state 

(15.52) |Beven—> = C([B>+b2>) = c(|b-> +10) 

the expectation value of any odd operator vanishes, and in a state 
(15.53) [boaa-> = o(|6) —|0>*) 


the expectation value of any even operator vanishes. 

The fact that, in the case where 0? = — J, a state and its time-reversed 
analog are always orthogonal—and that all hermitean operators, be 
they even or odd, are necessarily represented by diagonal matrices in 
reversality space according to (15.39)—can be summarized by saying a 
“superselection rule” holds: There are no observables which have matria 
elements connecting the orthogonal states |b! and |b"). 

Despite the impossibility of introducing the concept of “reversality” 
of a state in analogy to the concept of parity, it is, in principle, possible 
to introduce the concept of “relative reversality” of a state and its 
time-reversed analog, provided one can have composite physical objects 
made up out of an object and its time-reversed analog, so that the state 
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of the composite object is a direct product |c> = |b) x |b>”. Such a state 
can be an eigenstate of @ in the sense that if @|¢> = r|c>, where 7 is a 
number, then the definition of r is not affected by multiplication of |b> 
with a phase factor e' because, in that case, |b>” has to be multiplied 
with e~, Unless one can take serious certain speculations, touched upon 
in Section 17, which make the muon-neutrino the time-reversed analog 
of the ordinary neutrino, no such states |c> seem to be realized in nature. 
However, a similar situation arises with respect to the antiunitary 
operation of particle conjugation, making it possible, for example, to 
consider the positronium state as a direct product of a particle state and 
its particle conjugate state, giving rise to the concept of the “relative 
conjugality” of electron and positron, exhibited as ‘‘conjugality” of 
positronium, as will be explained in Section 28. 

It is interesting to speculate whether there is any physical attribute 
which removes the basic degeneracy of the states |b_> with respect to all 
other attributes B. Such an attribute might be the fermion number F, 
which has the value +1 for any fermion and the value — 1 for any anti- 
fermion, is additive, and is, by experimental evidence, strictly conserved 
in all known interactions. Thus if |b) represents a fermion state satis- 
fying 
(15.54) FibL> = +|b> 
then |b”> would represent an antifermion state, satisfying 
(15.55) F\b2> = —|b*. 


The operator / would act in the space spanned by the vectors |b> as 
identity and thus have in reversality space the representation 


(15.56) ee ee i) 


which guarantees that Eqs. (15.54) and (15.55) hold, and makes F an 
operator odd under time reversal, 


rl 90 IE “HG 
rfl HC gee 


Thus, if |b_> represents an electron characterized by the quantum num- 
bers (k, j = },m = +4), the time reversed state |b”> describes a positron 
having the quantum numbers (—k, j = }, m = —}). This is precisely the 
suggestion made by Feynman, who proposed to describe positrons as 


(15.57) FT_+7_F 
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electrons “running backwards in time.” If this point of view is adopted, 
then the strict conservation of / becomes a consequence of the super- 
selection rule which does not permit the transformation of a fermion state 
into an antifermion state and vice versa. 

Unfortunately, this point of view, though very attractive, is not more 
than a tentative hypothesis based on rather tenuous ground, because it 
is not at all certain that the degeneracy of states belonging to the 
quantum number @? = —1 is always the origin of the superselection 
rule leading to conservation of fermion number. In fact, there is yet 
another symmetry operation of apparent universal validity, namely 
the combined inversion 


(15.58) evade 


made up out of the unitary operation of coordinate inversion JT and the 
operator I which represents the conversion of particles into antiparticles 
and vice versa. In the case of fermions, + is antiunitary, as will be shown 
in Section 19, and therefore gives rise to yet another dichotomic quantum 
number, namely the eigenvalue of 2® which may be +1 or — 1, as in the 
case of the operation 6°, Thus, if a fermion belongs to the quantum 
number 2* = —1, yet another superselection rule would separate the 
halfspaces |b> and 2|b>, and could be made responsible for the strict 
conservation of yet another attribute which removes the degeneracy 
between the state and its combined inverted state. Altogether it should 
be possible to classify elementary particles into “types” which are 
labeled according to the values of eg = O® and ey = 5” as |b... .,>, and 
of which there must then exist four types, namely |b,,>, |b,->, |b, 
ee 

Experimentally, there is strong evidence that fermion number is 
separately conserved for baryons and for two kinds of leptons. One may 
speak of a law of conservation of baryon number B, of a law of con- 
servation of lepton number L, and of a law of conservation of muon 
number L,,. 

There are some reasons to believe that the conservation of the lepton 
number L, associated with the ordinary neutrinos (L = +1) and anti- 
neutrinos (L = —1) emitted in various 8 decays of nuclei, is a conse- 
quence of the superselection rule arising from the quantum number 
€y = — 1 attributed to neutrino and antineutrino (as well as to electron 
and positron), because the antineutrino state |») can be obtained from 
the neutrino state |y> by combined inversion, |y> = 2|v>, and similarly 
the positron state |e+> should be considered as obtained from the 
electron state |e~> by combined inversion, as will be shown in Sections 
17 and 19. Muon number L,, and baryon number B, on the other hand, 
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ean be consistently associated with the superselection rule flowing from 
time-reversal symmetry, as will be shown in Sections 17 and 29. 

In order that an attempt at classification of elementary particles into 
“types” can be made in proper detail, it is found convenient to develop 
first a formalism which allows the description of states with a variable 
number of particles present. 


NOTES 


Kramers [1] first noticed a peculiar twofold degeneracy of states 
describing an odd number of electrons in absence of external magnetic 
fields, which was recognized by Wigner [2], as a consequence of time- 
reversal symmetry in case of systems of half-odd integer angular 
momentum. This paper also marks the advent of anti-unitary symmetry 
operations in quantum mechanics. 

Zocher and Térdk [3] have given a review of the on first sight in- 
conspicuous consequences of time-reversal symmetry in classical 
physics. 

Wigner [4] has developed the formalism of anti-unitary operators in 
full generality, and has also adduced [5] experimental criteria which 
-permit to decide, in principle, whether a symmetry operation must be 
represented by a unitary or an anti-unitary operator in quantum 
mechanics. 

Feynman [6] has promulgated the view to look upon positrons as 
electrons ‘‘running backward in time.” 

Wick et al. [7] have coined the term ‘‘superselection rule” for situa- 
tions in which there are neither spontaneous transitions between states 
belonging to two subspaces, nor measurable quantities with finite 
matrix elements between these states. 

Wigner [8] has pointed out that one might be able to salvage, should 
the need arise, symmetry with respect to reversal of motion by introduc- 
tion of the concept of metamatter, distinct from antimatter. 
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SCION IG 


The Particle Concept in Quantum Mechanics 


The formalism of quantum mechanics lends itself naturally to an 
adaptation toward the description of “particles,” which are thought of 
as carriers of various observable attributes, to be elaborated in the 
following Sections 17 and 18. 

It should be understood that the particle concept in quantum mechan- 
ics is an abstraction rather far removed from the naive particle concept 
of classical physics, in which a particle may be visualized, with impunity, 
as a kind of small ball, capable of travelling as a sort of coherent body 
along a specifiable path in space and time. The very incompatibility of 
observable position and momentum has led in quantum mechanics to 
an erosion of such naive pictures, and the word “particle” stands for a 
quantum mechanical state characterized by a set of quantum numbers 
which are associated, in principle, with an identifiable event such as the 
momentum transfer in a “collision,”’ or with a sequence of events such 
as the vapor trail in a cloud chamber. 

This gain in abstraction, purchased with the loss of a naively satisfying 
picture, has brought under the domain of the quantum mechanical 
particle concept phenomena which earlier were thought,of as belonging 
to the classical field concept. Thus, large sections of the field dynamics 
of gases, fluids, and solids have been grasped quantum mechanically 
through introduction of the concept of the exciton, the phonon, the 
roton, etc., and it has become possible to describe many properties of 
macroscopic bodies with the same formalism that was introduced 
originally for the purpose of describing the so-called elementary particles, 
the fermions and the bosons. 

A remnant of the classical distinction between “actual” particles such 
as electrons, and “actual”’ fields such as the velocity field in a fluid, is 
the quantum mechanical distinction between particles and quasi 
particles. There is, at present, still a conceptual division between the 
“vacuum” from which the various elementary particles are thought to 
arise, and the ‘“‘quasi vacuum” from which the various quasi particles 
or excitons of solid, liquid, or gas state may arise. 

By generally accepted usage, the elementary particle vacuum is 

115 


116 Concepts in Quantum Mechanics 


imagined as a completely empty state. The quasi vacuum, in contrast, 
is always thought of as some kind of full state, either as the ground state 
of a lattice made up out of actually present atoms, or as the electronic 
Fermi sea without holes or gaps, etc. However, the formalism, to be 
developed later on, does not reflect this distinction between vacwum and 
quasi vacuum, particles and quasi particles. Both vacuum and quasi 
vacuum can always be represented by a state vector denoted |0>, 
indicating a state completely empty of particles and quasi particles, 
respectively. 

In this connection it seems worth recalling the curious historical fact 
that, at a certain stage in the development of the theory of elementary 
particles, Dirac found it convenient to introduce as the vacuum state 
for the electron-positron particle system a far from empty state, namely 
the state in which all negative energy levels were filled without holes or 
gaps, and which thus resembled what today would be called a quasi- 
vacuum state. That Dirac could do so without getting involved in serious 
inconsistencies is rather remarkable, and the formal reasons for Dirac’s 
alternative will be taken up in some detail in Section 17. 

In any case, the distinction between particles and quasi particles in 
recent years has shown a tendency to become blurred, and as a result 
there has been a vigorous cross-fertilization of elementary particle theory 
and theories of the solid, liquid, and gas states of matter. 

It is, of course, not intended here to deny by these remarks the useful- 
ness of the description of macroscopic bodies in terms of atoms making 
up, say, a solid lattice. After all, lattice atoms are rather manifest in 
experiments involving relatively high energies, such as in X-ray diffrac- 
tion patterns produced by solids, so that in this experimental sense one 
may say lattice atoms do, in fact, exist. 

Nevertheless, it seems by no means established that one cannot, in 
principle, account for all observations made on macroscopic bodies in 
terms of quasi particles. Should such a program turn out to be realizable, 
one can logically conceive a situation in which it might become feasible 
and even profitable to abandon the concept of atoms entirely and describe 
matter entirely in terms of what are now called quasi particles. 

In this vein, one can speculate about the possibility of developing a 
theory of elementary particles in terms of an underlying substratum or 
Urmaterie, which might play for the elementary particle vacuum the 
same role as that played by the atoms of macroscopic bodies for the 
quasi vacuum, which is a substratum from which the quasi particles arise. 
Dirac’s original notion regarding the electron-positron vacuum may 
conceivably have been abandoned too rashly. 

Returning now to the established knowledge about particles and quasi 
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particles, there is a categorical statement one can make about the 
occupation number of fermion or boson states which cannot be made, in 
the same categorical fashion, about quasi particles, as follows. 

Suppose one knows what constitutes a complete set of quantum 
numbers + for a physical object, so that specific values of + characterize 
the object completely. For example, in case of an elementary particle 
describable in terms of momentum k, spin s, and lepton number L, 7 may 
stand for the six quantum numbers (k,j,m,L). Since the notion of 
particle implies the existence of objects which can, in principle, be 
counted, one may infer the existence of occupation states |0,>, |1,>, 
|2,>, ... which are eigenstates of a suitably chosen operator representing 
the observable number of particles having property 7, so that these 
states describe situations in which, respectively, 0, 1, 2, ... particles of 
property 7 are present. 

For reasons only partly understood to date, all particles with half-odd 
integer spin, called fermions, satisfy the 


Exclusion Principle: The number of fermions NV, in a given, complete, 
quantum state 7 is restricted to either 0 or 1. 


The number 7, of particles with integer spin, called bosons, in a given 
quantum state 7 is apparently unrestricted, except that n, must be > 0. 

No categorical statement of this kind can be made for quasi particles. 
Restrictions on the occupation number of quasi-particle states, if any, 
will have to be stated separately for each case. 


NOTES 


Pauli [1] discovered the exclusion principle. Readers interested in the 
history of the profound contributions to quantum mechanics made by 
Pauli will find abundant food for thought in Fierz and Weisskopf [2]. 

The connection between validity of the exclusion principle and spin, 
which is difficult to understand, has been the subject of work by Pauli [3] 
and Liiders and Zumino [4]. 

Dirac [5] first conceived of the vacuum as a state which need not be 
empty. 

A more recent attempt to fashion a description of elementary particles 
after a quasi-particle model is contained in the work by Nambu and 
Jona-Lasinio [6]. An earlier, more naive, attempt by Kaempffer [7] 
turned out to be abortive. 
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—$ 5 $$ SECTION 17 


Fermion States 


The exclusion principle implies that the number of fermions NV, in a 
given complete quantum state 7 is a dichotomic variable. The only 
existing occupation states are |0,> and |1,>, which may be represented 
in a two-dimensional abstract space as 


ony ro=()5 Ho =(), 


These state vectors form an orthonormal set. A suitable operator 
representing the observable fermion number N(z) is obviously, in this 
space, 


0 0 
(17.2) Nae (; :) 
since it has the required property 
(17.3) N(z)|0,> = 0,|0,>; -N(r)|1,> = 1,1. 


A particularly useful concept is that of the annihilation operator a(r) 
and the creation operator a*(r) which connect the states |0,> and |1,> 
according to 


(17.4) a(7)[1,> = ¢,|0,>;  a*(r)|0,> = eF|1,> 


where ¢, is some complex number. Consistency with the exclusion 
principle requires further the relations 


(17.5) a(7)|0,> = 0; at(r)|1,) = 0. 
A suitable set of such operators is 

ol 0 0 
(17.6) ae e,( . oi at(r) = et(; o 


which justifies the notation adopted in (17.4) making a+(r) the hermitean 
conjugate of a(r). The requirement that successive application of one 
creation operator and one annihilation operator in either order should 
restore the original state subjects the factor c, to the condition 
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(17.7) e,|2=1 

so that indeed 

(17.8) 
a*(r)a(r)|1,> = ¢,a*(r)|0,> = |e,||1,> = |1,>; ete. 

With this choice of c, one finds the operators a(r) and a*(r) connected 


with N(r) by the equations 


at(ryate) = lee*(5 4), = ers 


Omen 
(17.9) 
+ 2(1 0 
a(r)a*(r) = |c,| = I(r)—N(r) 
0 1), 
where I(r) stands for the identity operator 
L 40 
(17.10) ene ( i). 
The two equations (17.9) added yield the anticommutation relation 
(17.11) a(r)a*(r) +at(r)a(r) = I(r). 
The description can now be extended to envelop all quantum states 
T1, Tz; +++) Tms ++» accessible to the fermions under consideration. The 


occupation state of a fermion system is defined in the product space 
spanned by the occupation states for the various quantum states as 


(17.12) 
[Noy Nag eee Nagsee> = [W>x [NX 20% [Nad Xoo 

For convenience it will now be agreed to write the occupation numbers 
in a certain order, corresponding to a certain sequence of labeling the 
quantum states 7,,. Thus 7,, will always be written to the left of +, if 
m <n. Any physically observable effect must, of course, be unaffected 
by any change in convention. 

An important special occupation state is the vacuum state defined by 


(17.18) ]O> = Op, Opy5 ++ +s Ops e+ o> 


which describes, thus, a completely empty state. To this state there 
exists, at least for fermions, an interesting counterpart, namely the full 
state defined by 


(17.14) peel Blea lee Slee 


It should now be possible to construct creation operators a*(r,,) which 
raise fermions from vacuum into the various quantum states 7,,, 
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(17.15) CaXGe)| Rock! esersac ates Ane ene 


and annihilation operators a(r,,) which remove fermions from the full 
state of the various quantum numbers r,,, 


(17.16) (ra) [glenn = ode Or ay 


In this fashion any occupation state (17.12) can either be raised from 
vacuum by successive application of the various creation operators or be 
obtained from the full state by successive application of the appropriate 
annihilation operators. 

By suitable choice of the factors c,,, one can represent, in the product 
space (17.12), the operators a*(7,,) and a(7,,) by matrices satisfying the 
anti-C.R.s 


(17.17) 
{4(Tm) UT )} = {4* (Tm) O*(Tm)} = 03 {A(T p) O*(Ty:)} = I(t) Sam 


where {ab} = ab + ba, and which contain relation (17.11) as a special case. 
This is accomplished by putting 


m—1 
(17.18) ¢,, = |] (—-2N,,) 

n=1 
so that condition (17.7) is again satisfied, c,,, being + 1 or — 1 depending 
on whether the number of occupied states V,,, = 1 with n < mis even or 
odd, because the occupied states each contribute the factor —1 to 
(17.18) whereas the empty states each contribute the factor +1. By 
noting the identity 


(17.19) H(rq)-2N(09) = _y) = Talend) (ony) 


one can write down immediately the matrix representation for a(r,,) 


anda*(7,,), 
(0 a) *(0 o),.*(0 9). 


(17,20) 
1 0 
(Tm) = ie ot a 
Ps 
may) atieg) = TE talrw (7 6) TT te) 
n=l n= 

the understanding being that a matrix labeled 7, lea solely on the 
subspace belonging to the quantum state 7, in the general product space 
(17.12). 

The representation used here has the interesting property of being 
invariant under a unitary transformation 
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U=U=() ) x 


which, in effect, interchanges the roles played by occupied quantum 
states and empty quantum states or “holes.” Indeed, for any n, 


(17.23) 
V'@) = UN() Ut = (; \(° He ) a ( °) = Ie)—Nte) 


x(? 0), =[TT hi) — (ay) 


n=1 


1 O/\O T/\1 0 0 0 
so that 
(17.24) Uloy =|; UI1y = |oy 
and 
(17.25) 
(tT) = Ualt,) 0+ = (—1)™1 a(t); a" (tm) = (—1)™ a(7%). 
Equations (17.25) can be written explicitly 
(17.26) 
wea =e() g) sera = oR(5 gh 
with 
(17.27) 


m—1 m=1 
(ie = U (1-27) = ca = (= y—tores 


It is this peculiar symmetry between occupied states raised from vacuum 
and corresponding holes in the full state which enabled Dirac to consider 
positrons as holes in a filled “sea” of negative energy electrons. 

For application to the theory of quasi particles it is often convenient 
to perform a unitary transformation U(z,,) in which, for some of the 
quantum states only, namely those belonging to 7, with n < m, the roles 
of occupied and empty states are interchanged. The corresponding 
operator is then 


(17.28) 


tare trea (6 Drala oe 


As a first example, consider a fermion in a simultaneous eigenstate of 
momentum k and spin quantum number s= +1 or s = —1, denoted 
|k,s>, which is obtained from vacuum by application of the creation 
operator a*(k,s), so that, including the two-dimensional occupation 
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space, the total state vector of such a single fermion state reads explicitly 
(17.29) |lk,.> = a*(k,s)|0> = |k,s> x()- 


The simplest particles that can be described completely by their momen- 
tum k and spin s are the neutrino and the antineutrino, which possess 
no mass so that energy 2 and momentum k satisfy 


(17.30) Q?-k? = 0 
or, if one requires energy to be a positive number, 
(17.31) Q = |ki. 


The neutrinos and antineutrinos arising in the 8 decay of nuclei differ 
experimentally in that the spin of the neutrino is always aligned parallel 
with a direction opposite to its momentum (spin and momentum are 
“antiparallel”’): the neutrino is left-handed, 


(17.32) (s-k) = —k = —|k|, for neutrino, 
whereas the antineutrino is right-handed, in the sense 
(17.33) (s-k) = +[kl, for antineutrino. 
Equation (17.31) can therefore be written 
—(s-k) for neutrino 
(17.34) Q= 
+(s:k) for antineutrino 


Accordingly, suitable Hamiltonians for the description of v and # should 
be 


(17.35) H, = —(o°P); H; = +(o-P), 
with the understanding that, upon resolution into components in spin 
space, 
(17.36) 
H|k,s> = +(o-P)|k,s> = |k||k,s> means 
#( Ps pie | mk) rs Ne (e222) 
P,+iP, —Ps |K, 32 |k, 8>e 


In coordinate representation, the eigenstate |k,s> can be character- 
ized by two-component # functions [see Eq. (9.30)] 


(17.37) <a(t)|k,8> = Wy, 6(q,t) = ety, .(q) 
so that 
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(17.38) |k,3> = f |a(t)> %,.(4,¢) dq. 


By the same reasoning as the one employed in the derivation of (9.30) 
from (9.20), these functions must satisfy, if one uses P = —1(0/0Q) 
and H = +%(@/dt) and denotes the functions for neutrino and anti- 
neutrino by ¢ and 7, respectively, 

(17.39) 


H,4$(q) = +i[o(0/q)]$(4) 
Pa +i(( (0/043) hide saad 9 i (3) 


/0q1) + 1(0/092) — (0/093) de, 
(17.40) 
H;7(q) = —t[o(0/0q)}7(q) 
/ a 0/8q1) —1(0/0 7 7 
(reas ioreae) tala) Yew) = ~¥1(%) 
The solutions of (17.39) can be obtained by writing 
(17.41) $n(@) = Ane 


with amplitudes to be determined from the linear equations 
+ksA,+(ky—tkg) Ay = —|k|Ay 
+(ky+ike) A;—ky Ay = —|kl A, 


and the normalization condition 


(17.43) J o%a) #(a)dq = (|4s|*+]42/*) V = 
} 


(17.42) 


The necessary and sufficient condition for existence of a nontrivial 
solution of (17.42) is the vanishing of the coefficient determinant 


(17.44) 
t+hg+|k| + (ki—tke) 
+(ky+tks)  —kg+|k| 


which is identically true and affirms that the solutions ¢4(q,,) are indeed 
of the form (17.41). One obtains thus, with suitable phase convention, 


(17.45) 
A, = 


= k*-/2-i8-18 = 0 


(ky — tke) rate (|k| +s) 
VVV2IEI(e| +h) | * VV BRIE] + bs) 


These amplitudes show dramatically the correlation between directions 
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of spin and momentum of the neutrino if they are written in terms of the 
polar angles 9, describing the direction of k, k, = |k|sindcos 9, 
ky = |k|sindsin 9, ks = |k| cos?d, so that 
(17.46) 

singe? 


gaa 1 
; VVV%X1+cos8) VV 


sin (9/2) e"? ; 


Aisies apo (6/2) 


because now the function 


ar) #0) = 7p( “sre J 


contains the spin state (2.42) describing a spin — 1 in direction #, @ of the 
momentum. 
The solutions of (17.40) are obtained by writing in the same fashion 


(17.48) m(q) = B,e™ 
yielding, with suitable convention of phase, 
(17.49) 
B (|k| +s) (ky + tke) 


ee ee ee 
VVV2Ik| (kl +s) 2 VS VV 2k] [| +s) 


so that in polar coordinates 


(17.50) ; ( sontes2) Jes 


7(@) = 7\sin (0/2) et? 


the antineutrino function 7 contains the spin state (2.41) describing the 
spin +1 in direction #, 9 of the momentum, as expected. 

Neutrinos and antineutrinos have an attribute, called the lepton 
number L, which is strictly conserved in all transitions involving creation 
or annihilation of these and other particles classified as leptons. By 
convention, one attributes the value L = +1 to neutrino », electron e-, 
and muon p-, and the value L = —1 to antineutrino 7, antielectron 
(or positron) e*, and antimuon *. Since no exception to this apparent 
conservation law is known, one is tempted to interpret it as consequence 
of some superselection rule associated with the existence of some anti- 
unitary symmetry operator whose square is —J. 

One such symmetry operator is the operator of time reversal @ which 
according to the development of Section 15 has indeed the property 
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©? = —I for leptons of spin } provided they possess no intrinsic attri- 
butes other than spin which might invalidate this property. However, 
this operator is not a possible candidate for generation of the desired 
superselection rule preventing any neutrino state going over into an 
antineutrino state and vice versa, because the operation of time reversal, 
changing the sign of both spin and momentum, does not change the 
handedness. The very existence of neutrinos and antineutrinos with 
definite but opposite handedness forces one, then, to draw a far-reaching 
conclusion regarding Feynman’s suggestion to consider antiparticles 
as particles “running backwards in time”: This suggestion cannot be 
adopted to distinguish formally the neutrinos and antineutrinos emitted 
in various 8 decays. 

Fortunately, there is an operator which changes the handedness of a 
particle, namely the operator of coordinate inversion IT, changing the 
sign of momentum only and allowing it therefore to be represented in 
spin space by the identity, so that altogether 


(17.51) W=IxIp 


where IT, operates on the dynamical variables such as P, H and repre- 
sents the operation q—> —q in any ¢ function. In accord with a remark 
made at the end of Section 14, I7p is by convention assumed to be real, 
so that 17} =I. 

As it stands, JT does not represent a symmetry operation on either 
neutrino or antineutrino states, because of the definite handedness of 
these particles, so that applied to a neutrino state, for example, [7 pro- 
duces a state not realized in nature, One can, however, repair this defect 
by introducing an operator of “particle conjugation” I" which, applied 
to any neutrino state, converts it into the corresponding antineutrino 
state without affecting the dynamical attributes, so that the operator of 
“combined inversion”’ (in the older literature often labeled PC) 


(17.52) == 07 


is now a symmetry operator of the neutrino-antineutrino system. 
Moreover, if I’ is assumed to be antiunitary with 


(17.53) r2= -I 


then one can treat both neutrino and antineutrino so that, for these particles, 
conservation of lepton number L becomes the consequence of a superselection 
rule separating the two halves of the inversion space spanned by the states 
|v> and |7> = Z|v>. 

A representation of I’ may be obtained by considering the merger of 
the two two-dimensional spaces spanned by the eigenfunctions of 
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— (oP) and (oP), so that a neutrino-antineutrino system can be described 
by a four-component # function 


(17.54) wa) = (19) 
In this space, I’ can be represented by 

0 il 
(17.55) r=(_} ok 


and IT by the identity, together giving for the operator of inversion the 
representation 


uy a! 
(17.56) se (ext 0) ok. 
One has then indeed 
(17.57) x? = —I, 
The lepton number L takes in this space the form 
1 0 
(17.58) Tin (0 > 
and one can write the expressions (17.35) as a single Hamiltonian 
ath _ j—{oF) 8 ) 
(17.59) = aia ( on 
satisfying 
(17.60) Ap(q) = —|k| $(q). 


These considerations do not prove that the operation of particle 
conjugation I’ must be antiunitary for neutrinos. Such a proof will be 
given later in Section 19 for electrons and positrons. All that has been 
shown thus far is the consistency of this assumption with the point of 
view from which the antineutrino is revealed as a neutrino ‘seen 
through a mirror.” Furthermore, these considerations do not prove that 
a particle corresponding to a time-reversed left-handed neutrino cannot 
exist in nature. All one can say is that such a particle, if it exists, cannot 
be identified with the right-handed antineutrino emitted in the 8 decay 
of nuclei. 

There is mounting experimental evidence for the existence of another 
kind of neutrino, the so-called muon-neutrino Mp and its antiparticle Pus 
having the same handedness as v and 7 respectively, and being emitted 
in reactions involving muons, for example, 


at —> pty; Te thy; 


pret toty; po ety, th. 
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Apparently, there is a second lepton number L,, (the ‘‘muon number’’), 
which is additive and conserved independently of L, accounting for the 
absence of processes such as 


eRe +y; pe o+ptnte. 
If the assignments of lepton numbers laid down in Table 17.1 are 
adopted, then the transition »~—e~+v,+¥, for example, will be 
allowed because the lepton numbers on both sides of the reaction are 
L=+1= +1+1—land L,=—-1= +1-1-1, 
TABLE 17.1 


AssIGNMENT oF Lepton NUMBERS TO THE VARIOUS Exist1InG LepTons 


pt Me et e v v Vp Mp 
L -1 +1 -1 +1 -1 +1 -1 +1 
iy f=) Sa ee 


It is this muon-neutrino v,, which can be accommodated by Feynman’s 
suggestion, if one adjoins the reversality space, so that, for example, 
after the fashion of (15.37) the muon-neutrino state is obtained by time 
reversal from the neutrino state, and the antimuon-neutrino state by 


time reversal from the antineutrino state, 
(17.61) > = 1O|v>; |» = —i0|v,> 


making the muon-neutrino a left-handed particle in accordance with 
observations carried out on the decay 7+—>p* +v,. 

The state |¥,,> will again result from application of the operation of 
combined inversion Z to the state |v,,>, and conservation of L among 
the muon-neutrinos will be guaranteed again because of 2° = —J. The 
conservation of L,,, on the other hand, will follow from the superselection 
rule owing to the property @® = —J of the antiunitary time reversal 
operation 0. 

This discussion raises a number of seemingly perplexing questions 
regarding the transformation properties of electron, muon, and nucleon 
states under time reversal @ and under combined inversion 2. If the 
assignment of lepton numbers L and L, proposed above is adopted, 
consistency would require one to consider positrons as spatial inverses 
of electrons, contrary to Feynman’s original intention, and only muons 
could possibly be identified with time-reversed electrons. The already 
baffling mass difference between muon and electron would, in this case, 
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acquire an additional degree of mystery through the necessary trans- 
formation m, = Om, @-!. The existence of only one baryon number 
assignable to nucleons and antinucleons, on the other hand, would 
suggest that for nucleons only one antiunitary symmetry operation has 
the property that its square is equal to —J. This turns out to be possible 
because of intrinsic attributes other than spin shared by baryons. In 
Section 29 reasons will be given for the suggestion that the conservation 
of baryon number B is indeed a superselection rule following from 
invariance under time reversal. 


NOTES 


Jordan and Wigner [1] invented the representation of creation and 
annihilation operators for fermions. 

Dirac [2] has pointed out the possibility of interchanging the concepts 
of “occupied” and ‘‘ unoccupied” fermion states. 

Wey] [3] invented the two-component equation which was employed 
to describe the neutrino by Lee and Yang [4]. This equation had been 
rejected, prior to the discovery of nonconservation of parity in weak 
interactions, by Pauli [5], on the grounds that it violated the reflection 
symmetry of nature. The neutrino had first been proposed as a hypo- 
thetical particle by Pauli [6]. The name “neutrino” is apparently due 
to Fermi. 

Konopinski and Mahmoud [7] first suggested the existence of a law 
of conservation of leptons, (See also Lee and Yang [4].) 

For assignment of the second lepton number L,, see, for example, 
Horn [8]. 
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Boson States 


Since the occupation number of a boson state characterized by the 
set of complete quantum numbers 7 is unlimited, the eigenstates of the 
boson number n(r) will span a denumerably infinite-dimensional space 
and may be taken to be represented by the orthonormal set 


il 0 
0 il 
(18.1) Joo =| 9]; [Ip =] Oy; -..5 
0 
oe components 0 
jn =| 1 
0 


In this space the operator representing the observable boson number 
n(7) is then diagonal and reads 


000 
010 
(18.2) vite) ta (Cian 


7 
A suitable set of operators representing, respectively, annihilation and 
creation of a boson in quantum state r is 

0 VI 0.0 

0 0 V2 oO 

OO 0y4/3 
(18.3) r)=| 0 0 0 0 
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0 0 0 0 
vl 0 00 
0 72" 0N0 
bt (7) = 0 oO V3 0 
T 
having the property 
(18.4) B(r)|n,> = Vn,|n,—L; bt(r) |[n,> = Vn, +1|n, +1). 


The normalization implied by (18.4) insures that any state with n, < 0 
is identically zero, and that the operator n(r) is connected with (7) and 
b*(7) by relations analogous to (17.9), namely 


(18.5) bt (7) b(t) = n(r); B(x) b*(r) = I(r) +n(r) 
which may be subtracted to yield the C.R. 
(18.6) b(r) b+(7) —b*(r) (7) = I(r) 


The treatment can now be extended to envelop all quantum states 
T1, T2) +++) Tm, «+. accessible to the bosons under consideration. The 
occupation state of a boson system is defined in the product space 
spanned by the occupation states for the various quantum states as 


(18.7) [tps Magy sees Mayes) = |My.) X |Mp,> X22 [Meg Kee 


It is easy to construct creation operators b*(r,,) which raise, from any 
state, another boson into the quantum state r,,, thus 


(18.8) BF (Ty) [+2 Meg se = Veg FL |. - Ie, + 1...> 


and similarly annihilation operators b(r,,) having the property 


(18.9) Bn) [+50 Repos = Ve, coe Leeds 
They are simply represented by 
0 vi 
V2 2 
(18.10) B(t,) = TT I(t) x : rae 24e5) 
n=m+1 


™ 
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0 
m—1 vi a C 
(18.11) b*(rm) = [] (tn) x V2 x TI Lr) 
=m + 


n=1 + nm 


T™ 


which satisfy the C.R.s, as is easily verified, 


(18.12) 
[B(tm) O(n) = (O*(4m) OF (tm)] = 9; (B(m) OF (Tm) = L(t m) Sm 


where [ab] = ab—ba. Equations (18.12) contain (18.6) as a special case. 

A very simple kind of boson is the photon, which, like the neutrino 
and antineutrino, has no mass and no internal dynamical attributes 
except momentum x and an abstract dichotomic polarization variable S, 
which will be labeled Ss = +1 for right-handed and S, = —1 for left- 
handed circular polarization in the direction of x. The photon differs 
fundamentally from those fermions, however, because it belongs to the 
eigenstates |j,m> of angular momentum with integer j. There is no 
meaningful distinction between a photon and its antiparticle; photons 
are observed in all linear combinations of right-handed and left-handed 
polarization. 

Experimentally, momentum x and polarization S, of a photon are 
compatible observables, and there should thus exist a representation in 
which a right-hand circular polarized photon is raised from vacuum by 
application of an operator b+(x%, R) 


(18.13) [lye.n> = Bt(,.R) |0> = |x, RD x 


and in which a left-hand cireular polarized photon is raised from vacuum 
by application of b+(x, Z), 


(18,14) |1y,2> = bt (x, L)|0> = |x, L> x 


where |x, R> and |x, > are the simultaneous eigenstates of momentum 
P and polarization S, so that they may be decomposed into 


(18.15) |x, R> = |x, +1>x |B; |x, L> = |x, —1> x |Z> 
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where | R> and | L> are the two-dimensional eigenstates of the dichotomic 
polarization Sy, which, according to the general development of Sections 
1 and 2, may be written 


(18.16) |R>= (a) |L> = ie Ss = F me) 


and where |x, + 1 are the eigenstates of momentum P satisfying 
(18.17) Pix, t1> = x|x, +). 


Since the photon has no mass, the energy w is, as in the case of the mass- 
less fermions, 


(18.18) o= |>| 

and since its spin j = 1 is aligned parallel to x for S;= +1 and anti- 
parallel to x for S; = —1, the vectors |x, + 1> must be eigenstates of the 
Hamiltonians 

(18.19) 

H,=+(3P) for Ss= +1 i.e. (JP) |x, +1> = |x| |x, +1> 
(18.20) 

H_=-(JP) for 9; 1 ie. (JP) |x, —1> = |x| |x, —b, 


respectively, where J are the component operators of angular momentum 
for j = 1, namely (see Appendix 1) 


1 (2 1 0 , (9 -& 0 
cbain) Beeman i(h ajeriilpe ye eel (S0 eM) elle 
V2\0 1 0 V2\o ¢ 0, 


He Op eit) 
Jg=(0 0 0}. 
ty aa 


In the space spanned by the eigenvectors of S3 (18.16) one can thus 
write the Hamiltonian 


(18.22) le ae “a = 55x (JP). 


Applied to a general photon state of momentum x, which is a linear 
combination 


(18.23) 
|x, S> = a(S)|x, R>+A(S)|x,L>; — |a|*+|B|? = 1, 
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the Hamiltonian then always has the eigenvalue (18.18), 
(18.24) H|x,S> = w|x,S>. 


It is firmly understood that the energy w is never anything but a positive 
number. 

An eigenstate of momentum P can, according to an insight gained in 
Section 14, never be a state of definite parity. It is intuitively clear that 
the operation of inversion of coordinates I7 will make a photon of 
momentum x and polarization Ss appear to look like a photon of momen- 
tum —x and polarization —S3, the mirror image of a right-handed screw 
being a left-handed screw, and vice versa. In the space (18.16) the operator 
of inversion is accordingly of the form 


(18.25) ned 


so that |R> and |Z) are converted into each other and Sg is odd under 
inversions, 


(18.26) 
Ts|R> = |L); Ts|L> = |R); TsS3+S3 IIs = 0. 
In the total space (18.15) one has then the representation 


(18.27) Tee ey a) 
where IT, operates on the dynamical variables according to 
(18,28) Wy dITp = J; pPH;' = —P, 
leaving thus the Hamiltonian (18.22) invariant, 
(18.29) 
0 —[UTpVUP) + (JP) IT, 
MH-HH=(pyeyp\tj] 0) TO 


as is required by the inversion symmetry of the photon. 

The operation of reversal of motion, on the other hand, will make a 
photon of momentum x and polarization Ss appear to look like a photon 
of momentum —x and polarization Ss, because the handedness remains 
unchanged if both P and J transform under @> as 
(18.30) 0,JOz' = —J; @,PO>' = —P. 


In polarization space the operator © can accordingly be represented by 


(18.31) Os = F 1) 
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so that altogether 


0, 0 
0 @, 


where 7' has in the spin space spanned by J, if they are represented by 
(18.21), according to (15.29) the form 


0 OD 
(18.33) T=({0 —l1 O}. 
1 0 0 


The Hamiltonian (18.22) is evidently invariant under 0, and the 
symmetry of the photon under reversal of motion is thus guaranteed. The 
question of the transformation properties of photon states under 
particle-antiparticle conjugation cannot be answered except in conjunc- 
tion with the transformation properties of fermions that are coupled to 
the electromagnetic field. This will be taken up in Section 28. For the 
purpose of the present section, it suffices to record that the operator of 
particle conjugation carried out twice, I, applied to a photon state is 
equivalent to the identity, I = J. Since also @* = +J for photons, the 
photon is of “type” |b,+, in the sense explained at the end of Section 15, 
and no superselection rules are involved in the creation and annihilation 
of photons. 

The solution of Eq. (18.24) is most conveniently carried out in momen- 
tum representation. Writing the six-component state vector in the 
combined spin-polarization space as 


a(S) x® 
cas = (BN 


(18.32) @ = 6x65 = ( iE 6, = TK; 


the two three-component vectors 


will be subject to the normalization conditions 

(18.35) 

ye =i 1; x’? x¥2=1  sothat <x, S|x,S> = |o|*+|B]? = 1. 
Since x” and x are eigenstates of S, with different eigenvalues, they will 


satisfy the orthogonality relation 


(18.36) 
<x, Bln, Ly = xx" = x" x* = (x, Lx, R> = 0. 


18. Boson States 137 


Equation (18.24) reads now, in components, with the representations 
(18.21) and (18.22), and P = x, 


(18.37) 
tg xf + (1/V'2) (1c ~ ten) xB = oxf 
(1/2) (1 + 4x2) xi? + (1/2) (11 — tg) xg = ws? 
(1/2) (1 + x2) xB — ey x$ = ox 


and a similar set for y{, 4, v% in which the signs of all terms on the 
left-hand side are reversed. In order that there exist nontrivial solutions 
the coefficient determinant must vanish, 


(18.38) 
tKky—w + (1/2) (11 — tea) 0 
+ (1/2) (1 + ix) -—w + (1/2) («1 — ax) 
0 + (1/2) (x1 + ix.) ¥ kg— 


= w(K? +13+K3—w*) = 0 
which is obviously satisfied for any w 4 0 because of (18.18), One finds 
without difficulty the solutions 


(18.39R) 
pm el EE, gs (aM, 
a ame) w—Kks” xf : Qn? ” 
Tipe ees Kg 
Xs w+ks 
and 
(18.39L) 
aly sleet feats —Ks, ry as (PEA, 
xi = w+ks’ ee Dw? * 
je oar (oo 
xs 20 wW—Ks 


In terms of polar coordinates @,p characterizing the direction of 
propagation x, so that «, = wsin#cos 9, co = wsindsin ¢, Ks = wcos#, one 
finds, as expected, that »" represents a spin state 


i (1+ cos) e-*? 
(18,40) Te se V2sind 
(1— cos #) ef? 
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describing the spin j = 1 aligned parallel to x, and x” represents a spin 
state 


y= (1—cos#) e“*? 
(18.41) xe = 5 V2sind 
—(1+cos#) ef? 
describing the spin j = 1 aligned antiparallel to x, because these states 
are eigenstates of 


(18.42) Jp,9 = J18indcos 9 +Jesind sin 9 +J3 cos) 
1 (‘inser sine? 0 ) 
0, 


=a sin) e'? 0 sin pe“? 

ms 0 sinde'? —+/2cos 
with eigenvalues +1 and —1, respectively, as seen by straightforward 
computation. The operator Js. has another normalized eigenstate, 
namely 


1 —sinpe? 
(18.43) yaw Wa V2cosb 
sin § e!? 
belonging to eigenvalue 0. Although this state cannot represent the spin 
state of an actual photon, since this would require existence of longi- 
tudinal polarization states |x,0> (say), it can be used to express the 
transversality of actual photons in the compact form 
(18,44) 
y*y® = 0; xy” =0 sothat <x, 0|x,S> = 0. 
Changing the polarization S of a photon state of given momentum x 
means changing the amplitudes «(S) and A(S) introduced in (18.23), One 
can abstractly describe such changes in polarization as rotations in the 
polarization space spanned by the eigenvectors of Ss. The generators of 
such rotations are, for given x, the operators 
S, = bt(R)b(L) +b*(L) b(R) 
— ib*(R) b(L) +ib*(L) b(R) 
Ss = b*(R)b(R)—b*(L)b(L) = n(R)—n(L) 
So = b*(R)b(R)+0*(L)b(L) = n(R) +n(L) 
which satisfy, as a consequence of the C.R.s (18.12) for the b* and 6, the 
C.R.s 


2 
ll 


(18.45) 
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(18.46) 

[Si1Ss] = 2185; [S283] = 208,; [S381] = 2082; [S,So] = 0 
(k = 1,2,3) 

and also 

(18.47) S82 +S8$+82 = 8o(So+2). 


They are thus, according to (2.46), isomorphic to the Pauli matrices 
(2.43) and may be represented in polarization space by 


(18.48) 


01 , _ (9 —#), iil fae 0 
ale PN gir ye? (os) 


This explains, after the event, the notation S, for the polarization 
operator. 

Generalization to include states containing ny; photons, each of 
momentum x and polarization S, proceeds without difficulty. The most 
general photon state |...n,,5...> will have to be a simultaneous eigen- 
state of the Hamiltonian 


(18.49) H=> Dy H(x, 8) b* (x, S) (x, 8) 


and the polarization operator 


(18.50) 
Ss = DASs(x) = D [b*(x, R) b(x, R) —b*(x, L) b(x, LY] 


having the eigenvalues 


(18.51) W =X Dony,s 
x 8 
and 
(18.52) S3 = L (M%,n— Mx, 1): 


* 


Because of the isomorphism of the polarization operators (18.48) with 
the operators representing an angular momentum j = 4, the construction 
of many-photon states from single-photon states in polarization space 
has to be carried out under the observance of the rules governing the 
addition of angular momenta, as given in Appendix 2. Specific examples 
will be treated in Sections 27 and 28, 

Instead of using the quantum numbers x, 8, to describe single photons, 
as has been done here, one can alternatively use a representation in 
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terms of the quantum numbers w, j, m, P (energy, angular momentum, 
parity). The ensuing formalism, although widely employed, is less 
transparent, and has therefore been relegated to Appendix 3. 

At this juncture the correspondence of the quantum-mechanical 
description of photons with the classical description of the same pheno- 
menon in terms of transverse electromagnetic fields E, B derivable from 
a vector potential A by differentiations, 


(18.53) E = —(aA/at); B=VxA 


can be established in a rather straightforward manner, if one stipulates 
that the energy W should correspond to the value of the integral 


(18.54) W = af (E* + B*)dq 


in the sense that the expectation value of H in a given state should be 
numerically equal to the value of W computed with the classical fields 
describing the same physical situation. 

This correspondence was traced by the originators of quantum electro- 
dynamics in opposite direction, when they were groping for suitable 
expressions which might serve as operators representing the classical 
fields of electrodynamics. As it turns out, the fields E and B, which prove 
so useful in macroscopic situations involving averages over many 
photons, are singularly unsuited for grasping elementary processes 
involving single photons. The purpose of the following considerations is 
thus not aimed at drawing comfort from establishing contact with 
classical electrodynamics. The theory of photons, laid out in this section, 
can stand on its own phenomenological feet. The aim is rather to exhibit 
the profound differences between classical fields and the quantum 
mechanical operators that must serve in their place. 

The polarization of transverse fields propagating in direction x can be 
conveniently specified by introduction of two real orthogonal unit 
vectors (1) and (2) which point in the direction of the intersections of 
the horizon of x with the equatorial (x,y) plane and the (z,x) plane, 
respectively, as indicated in Fig. 18.1. Since the horizon for any zenith 
intersects the equator at the east-west line,* their components are 


(18.55) e,(1) = sing; €,(1) = —cos9; eg(1) = 0 
(18,56) 
€,(2) = cos cos 9; €,(2) = cos#sin 9; €3(2) = —sind 


* The author is indebted to Dr. Luis de Sobrino, former Lieutenant in the Spanish 
Navy, for illumination on this point. 
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where # and ¢ are the polar angles describing the direction of the pro- 
pagation vector according to 


(18.57) 
nN, = (k;/w) = sin#cos 9; No = (ky/w) = sindsin 9; 
Ng = (ks/w) = cosd. 


Fig. 18.1. The polarization vectors €(1) and €(2) characterizing transverse fields pro- 
pagating in direction x. 

For the purpose of specifying circular polarization, one may introduce 
alternatively the complex orthogonal unit vectors 


(18.58) e(R) = (1/V2) [e(1) + i€(2)] 
(18.59) e(L) = (1/2) [e(1) —ie(2)] = e*(R) 
satisfying the orthonormality relations 

(18.60) e*(R)e(R) = e*(L)e(L) = 1 
and 

(18.61) e*(R)e(L) = e*(L)e(R) = 0 


as well as the transversality conditions 
(18.62) ne(R) = 0; ne(L) = 0. 
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Equations (18.60)-(18.62) are formally reminiscent of the ortho- 
normality relations (18.35), (18.36), and the transversality condition 
(18.44). This suggests strongly that one identify the components of the 
complex state vectors x", x”, and y° with appropriately chosen linear 
combinations of the vectors e(), e(Z), and n, respectively. By the substi- 
tution of (18.55), (18.56), and (18.57) for the components in (18.40), 
(18.41), and (18,43) one finds without difficulty 


(18.63) 
2 I (yi) i 1 (“vec 
==; 2e(R) |; xX =— 2eg(L) Ji 
V2\ oR) +ied(R) V2N\ (1) +ied(L) 
l ( vin” 
ye =— 2n. 
v2 eens 


With some hindsight, this correspondence between spin states and 
polarization vectors can be made more obvious, if one performs in spin 
space the unitary transformation 


(18.64) 


gafete oy 2 ,(-h i 0 
U=—|-i 0 -it); Ut =—-| 0 0 v3) 
vil 0 v2 ' al eG 


leading, for the spin operators, instead of to (18.21), to the representation 


(18.65) 
0 0 0 0078 
8, = UJ,0t={0 0 t]; 8, = UJ,Ut | 0 0 ): 
0 4 0, -i 0 0 
0 -i O 
8, = UJ,U* = [t+ 0 | 
0 0 90 


and to a rearrangement of the components of the spin states, so that 
€(R) 
(18.66) 4 = Ux® = (e(R)) = (Rk); n’ = Ux” = e(L); 
€3(2), 
no = Uy2 =n 


and the orthonormality relations 
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(18.67) yh =n yh = 1; yy = ny! = 0; 
An? = ny? = 0 
are now identical with the relations (18.60), (18.61), and (18.62). 


The eigenstates of sg, namely 


nit 0 Ae 
(isles) Ea — fe, EOS ioe eee as 
V2\o 1 V2\ 9 


can be looked upon as a system of three orthonormal vectors which can 
be used to decompose any arbitrary vector f. This is particularly useful 
when one wishes to use the angular momentum representation, leading 
to a description of photon states in terms of so-called vector spherical 
harmonics, which are given in Appendix 3. 

The description of the state vector (18.34) in coordinate representation 
requires a six-component # function 


(18.69) W(x) = (age) x = (q,4), 


where (x) and y"(x) are three-component % functions satisfying 
Schroedinger equations which follow from (18.19) and (18.20) upon 
substitution of the representations P = —iV and H = i(@/dt), and which, 
in the representation (18.65) of the spin matrices, read 

(18.70) —is‘Vy® = i(0/dt) p*; tis Vb" = i(d/dt) p". 

By writing ¥” and ¥” as vectors p* and tp”, each having the components 


1, We, ws, these equations can be written in vector notation, because of 
the representations (18.65), 


(18.71) Wxp® = ip /an); — —Wxep” = i(aep/any 
and can be solved under observance of the transversality conditions 
(18.72) Vk = V-pt = 0, 


The formal resemblance of these equations to Maxwell’s vacuum 
equations for the complex field vectors E+iB and E—7B suggests the 
introduction of the hermitean operators 


(18.73) E(x) = = ta Pa Zi “ [e(S) b(2, 8) e** — eG) bt, 8) etter] 
(18.74) .! 
Vx Bla) = > > > w fi © [e(S) pe, 8) e+ e¥(8) btu, 8) et] 
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where xx = wt—xq and V is a normalization volume, which can be 
derived by differentiations according to the prescription (18.53) from 
the vector potential operator 


(18.75) A(z) = rit 5S Te [e(S) b(2e,.8) e-** + €*(S) b*(e, 8) =]. 
a a 


One has then the decompositions 


(18.76) ele, Vase" e(S) b(xe, 8) = Ja[-B+ a] 
* Ss 


(18.77) 


a > ve Vet eX(8)b'(x,8) = 7| 8+” 5 B)| 


where we is an operator defined by 


etima  gtixa 


(18.78) re ae 


Multiplication of (18.76) with (18.77) from the left and integration 
over dq gives, upon utilization of the normalization conditions on the 
left-hand side, 


(18.79) EY wd*(xe,8) (x, 8) = sf {B°+ B+ (i/V —V>) [E(v xB) 


—(V x B)E}} dq. 


If one could ignore the operator nature of E and B, the term containing 
E(V x B)—(V x B)E would obviously vanish and the desired corres- 
pondence to classical fields is established. In quantum mechanics. the 
presence of these terms is indispensable, however, because without them 
the expectation value of the integral on the right-hand side of (18.79) 
would become infinite for any photon state, including the vacuum state. 
This peculiarity stems from the operator nature of E and B which contain 
both creation and annihilation operators, so that the field intensity 
operators E* and B? each include terms of the form b(x,S)5*(x,S) whose 
vacuum expectation values do not vanish, leading to the infinite result 


(18.80) <0|4 f (E8+B*)dql0> = 4 Do. 


The presence of the term { (i/2V — V) [E(V x B) — (V x B)E}dq is needed 
to precisely compensate this so-called zero-point energy, as @ simple 


xf @ 
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calculation will show. This matter will not be pursued here beyond noting 
that the concept of zero-point energy does not enter the theory of photons 
if one adheres to the description which has led to the adoption of 


LX Ywbhtd as energy operator, because the vacuum expectation value of 
x 5 


this operator is obviously zero, and remains finite for any state containing 
a finite number of photons. 

The conceptual differences between quantum mechanical and classical 
descriptions of electromagnetic polarization phenomena are equally 
radical. Taking the most general case of elliptical polarization as an 
example, one infers, in classical electrodynamics, the possibility of 
determining simultaneously the values of three parameters fixing the 
polarization, for example, the parameters ¢;, és, y, so that at any instant 
of time the electric field vector in the [e(1), e(2)] plane has the components 


(18.81) Ey = e, cos (wt); Ey = e2c0s (wt+y), 


the endpoint of E tracing the polarization ellipse in that plane. 

Tn quantum mechanics no such statement can meaningfully be made 
about the expectation values of the operators (18.73) representing these 
field components, because the operators 6 and b+ have vanishing diagonal 
elements only, and therefore the expectation values of the field operators 
themselves will vanish in any state of definite photon number. From an 
operational point of view, this circumstance is a very satisfactory feature 
of the quantum mechanical formalism, because one cannot talk meaning- 
fully about the measurement of electromagnetic fields without invoking, 
in principle, an apparatus interacting with the field so that emission and 
absorption acts take place making the photon number variable. One can, 
however, conceive measurements of field intensities, i.e. quadratic 
functions of the fields, in stationary situations in which the total number 
of photons present does not change. The parameters ¢), ¢2, y, should 
therefore be set in correspondence with operators containing the creation 
and annihilation operators at least bilinearly. The operators S,, So, Sg 
introduced earlier in (18.45) satisfy just that requirement, and their 
expectation values can be used to describe the state of polarization even 
in case of partially polarized light beams. 

Since the polarization of the photon is a dichotomie variable, giving 
rise to the isomorphism of the abstract polarization operators (18.45) 
with the Pauli matrices, the development of Section 4 can be applied 
and the polarization of a photon beam described in terms if the density 
matrix 


(18.82) M = }(I+Ps) 


where |P| represents the degree of polarization of the photon beam. 
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In particular, the entire information about the polarization of the 
single photon state (18.23), which is a state of definite polarization so 
that |P| = 1, resides in the density matrix 


a(S) a(S) a(S) B*(S) 
Gabe ms) = (Aiyats) Bis)eMS) 
where the numbers «(S) and A(S) are subject to the normalization 
constraint |a|®+|8|* = trace M(S)=1 and are determined up to a 
common arbitrary phase factor. Using for S;, S2, S3, So their representa- 
tions (18.60) in polarization space, one obtains for their expectation 
values 


S, = trace[S, M(S)] = «*B+ap* 
Bo = —i(a* B—of* 
(18.84) elit: aaa) 
Ss = aa* —Bp* 
By = aa* +B. 


These quantities can now meaningfully be set equal to the classical 
Stokes’ parameters 


(18.85) S, = }e,e,cosy; Sy = fe ensiny; Ss = (e223) 


which are quadratic functions of the field amplitudes ¢), ¢2, establishing 
thus the correspondence between the parameters ¢,, ¢2, y and the # 
functions «, B in polarization space. 


Thus far, the discussion has been restricted to transverse photons 
which can be described in terms of a three-component vector potential 
A(q). This treatment has the aesthetic shortcoming of not being mani- 
festly covariant under Lorentz transformations. A little thought shows, 
however, that this is no serious defect, because the electromagnetic 
potentials have the property of “gauge invariance,” permitting reduc- 
tion of the formally relativistic expression 


(18.86) 


AX (x) = we; er, TE [eu(S) b(2e, 8) e** + eh (S) * (xe, 8) et] 


(u = 1,2,3,4) 
to the form (18.75) by a gauge transformation 
(18.87) A, (a) > A),(x) = A,(x) + (0A/Ax,) 


where A is a scalar function, which, for given fourth component of 
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<,(S), is constructed so that Kq. (18.87) is equivalent to the replacement 


(18.88) (8) > ¢,(8) = eulS) — [eq(S)/w] 
so that 
(18.89) 
«{(S) = 0 and ut, = —x=0= eukp for § = 1,2, 


The vanishing rest mass of the photon, i.e. x? = Ky ky = 0? —x® =0 is 
obviously the root of this particular invariance of the transversality 
condition (18.89). 

In presence of sources of the electromagnetic field, the description 
(18.75) acquires another shortcoming far more serious than the aesthetic 
flaw just mentioned, namely it becomes incomplete: transverse photons 
are not sufficient to grasp, for example, the Coulomb field surrounding 
an electric charge. One requires an additional two polarizations, corres- 
ponding to “longitudinal” and “time-like” photons, to accommodate 
the entire range of electromagnetic phenomena. 

The last word has almost certainly not been said about this matter. 
The purpose of these paragraphs is to summarize briefly the formalism 
invented, and widely accepted, for the description of these exotic 
photons, which do not exist as free particles. 

One can easily enough extend the definition of the operator (18.86) 
in a relativistically covariant manner to the case of four independent 
polarizations by simply writing 


(18.90) 
A,(z) = (1/VV) z z (1//2w) Leu(S) b(,S) e** + eff (S) bt (xe, 8) ef] 


where now e¢,(S) are a set of fowr orthogonal unit vectors satisfying 
(18.91) ulS) &ji(S’) = Iso. 


A possible and popular choice of the polarization vectors €,(S) con- 
taining for S = 1, 2 the transverse polarizations as before is 


e(1)x = 0, e4(1) = 0; €(2)% = 0, €4(2) = 0; 


(18.92) ae 


x/w, €4(3) = 0; e(4) = 0, e4(4) = 1; 


allowing one to refer meaningfully to ¢,(3) as the longitudinal and to 
¢,(4) as the time-like polarization vector. With this choice the polariza- 
tion vectors satisfy the formally covariant relation 


4 


(18.93) = e.(S) e¥(S) = CHa 
1 
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However, the choice (18.92) is neither necessary, nor is it necessarily 
the most convenient. On grounds of relativistic covariance alone it is 
sufficient to require instead of (18.93) 


(18.94) = €,08) €f (3) = Byy t+ ky ky S(e?) 


with arbitrary f(«®). In fact, the various choices of «,,(S), belonging to 
different f(x®), are equivalent to different gauges adopted for the elec- 
tromagnetic potentials, and keeping the function f(«*) open in all 
calculations is a convenient way of making the gauge invariance of the 
description manifest. 

To see this, consider a gauge transformation (18.87) where A(x) is now 
the most general scalar function linear in the A,,(z), 


(18.95) A(x) = —F(—[) (@4,/@z,). 


Here F is an arbitrary function of the scalar operator [_] = 0°/0x, 0x, 
and the signs have been chosen purely for convenience. With 4,,(x) 
given by (18.90), one finds explicitly 


(18.96) A(x) = (VV) x [F (x?)/-V 20] [xc, €,(8) B(x, 8) 


—K, €F(S) b* (x, 8) 7] 
so that 
(18.97) Aj(x) = (1/V/V) z = (1/V2w) {Le.(8) 


+ F(ie2) Ky Ky €)(S)] D(oe, 8) e-** + 0.0.}. 


The gauge transformation generated by (18.96) is thus equivalent to 
replacing the polarization vectors by 


(18.98) €(S) > ¢,(S) = €,(8) + F(«*) Ky Ky €y(S) 

giving 

(18.99) 3 €(8) €*(S) = By + ky KyL2F (Ke?) +? F(x), 
s=1 


which is identical with (18.94) if one drops the primes on the left and sets 
(18,100) f(«2) = 28 (x?) +x? F?(«?). 


For many applications it is more convenient to introduce a scalar 
function d,(«”) by 


(18.101) fle?) = (— Ve? 
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which splits (18.94) into two terms, namely 


4 
(18,102) 3 ¢,(5) €f(S) = (4.- 4%) +d,"0%, 
S=1 K K 
The first term can be called the “transverse part” in a four dimensional 
sense, because it satisfies 


4 tr 
(18.103) Ky (3, e,(S) <1(8)) = 0. 


The arbitrariness in the gauge of the vector potentials lodges now in the 
arbitrariness of choice for the factor d,(«®). If one can show that observ- 
able effects do not depend on d,, one has a “manifestly gauge-invariant ” 
theory. 

The conventions (18.92) leading to (18.93) are obviously equivalent to 
putting d, = 1. An alternative choice, which is at least as convenient, is 
d,=0. For many applications it is advisable, however, to carry the 
unspecified factor d, in all calculations, thus retaining and exhibiting 
freedom of gauge, and to settle for a particular value of d, only when this 
results in an overwhelming computational advantage. 


NOTES 


Jordan and Klein [1] introduced the concept of creation and annihi- 
lation operators for bosons. See also Dirac [2]. 

Jauch and Rohrlich [3] treat transverse photon polarization and give 
complete references to earlier work on this subject. 

Archibald [4] noticed that one can write Maxwell’s vacuum equations 
as Schroedinger equations for two 3-component y functions. See also 
Akhiezer and Berestetskii [5]. 

Landau and Peierls [6] gave the decomposition of photon annihila- 
tion and creation operators in terms of electromagnetic field operators. 

Rose [7] gives the unitary transformation (18.64). 

For a discussion of some of the perplexities introduced through 
longitudinal and time-like photons seo Kallén [8]. 

Bogoliubov and Shirkov [9] use consistently a gauge convention in 
which the function d, is left arbitrary. 
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_ SECTION 19 


Electrons and Positrons 


Particles having a rest mass m, so that the relation between energy {2 
and momentum k is 


(19.1) Q? =k? 4m? 


need not have their spin aligned either parallel or antiparallel to the 
direction of k. Even if in a given coordinate frame spin and momentum 
happen to be aligned, this alignment will be destroyed by a Lorentz 
transformation if the particle travels at a speed less than the speed of 
light, because spin, being a skew tensor, and momentum, being a vector, 
transform differently under such transformations. 

Electrons and positrons, in particular, are particles belonging to spin 
j= 4% which differ from neutrinos and antineutrinos in that their spin 
does not exhibit the firm correlation with direction of momentum k, 
which makes the y function of massless particles obey either an equation 
of the form [H —(o-P)]u = 0 if they are right-handed, or [H +(o-P)]v = 0 
if they are left-handed, as was explained in detail in Section 17. One 
cannot include the effect of a rest mass by simply adding a term mu or mv 
to the equations for wu and v respectively, because if m is a scalar the 
resulting equations would not be invariant under inversions J7. A way 
of accommodating the mass without violating any invariance require- 
ments, including invariance under Lorentz transformations, was 
invented by Dirac, who showed one can describe electrons by four- 
component y functions obtained by coupling the two two-component 


functions w and v through the mass term, 
(19.2) (H—oP)u = mv 
(H+oP)v = mu. 


In the limit m +0 they describe uncoupled right- and left-handed mass- 

less particles of spin j = 4, but for m # 0 these first order equations are 

equivalent, by iteration, to the second order two-component equations 
(A +oP)(H—oP)u = m?u 


(H—oP)(H+oP)v = mv. 
161 


(19.3) 
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For free particles these equations are equivalent, because of the C.R.s of 
the a, to 


(19.4) (H?—P?—m?*)u = 0; (H?—P?—m?)v = 0. 
Introducing the 4 x 4 matrices 

0 -o 01 
(19.5) v=(o “ts m=( oO) 
and the four-component } function 

w 

19. = 
(19.6) v=(') 


one can, because of 


wr h(a oC) 


write Eqs. (19.2) in the compact form 


(19.8) (ye H—yP) yh = mp 

This equation, due to Dirac, can alternatively be written 
(19.9) (aP+Bm)% = Hy 

with 

(19.10) a=yy;s B=Y 


The matrices (y,7s) =y, introduced here are identical with the ones 
used by Feynman. They differ from the y's used in most conventional 
texts by a unitary transformation. They satisfy the anti-C.R.s 


-1 
(19.11) Y%v+%rYp = Buy with Suv = 
+1 


A very important matrix in this abstract four-dimensional space is 


a 0 
(19.12) Ys = V1V2¥8V4 = ( a4 
which satisfies 
(19.13) ye=—-1 and ysy,+ry¥s = 9 


The operators 


(19.14) Or = HU +ins) = (5 1) 
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and 
2 ¢ Kea} 
(19.15) Can ea i 4 


obviously project the components u and v out of +, respectively, 


(19.16) Cn (‘) =n; Onp = (') = 


Accordingly, the operator 
: 1 0 
(19.17) C= -y= if Le 


can be called the “chirality” or “handedness” operator, because yp 
and ys, are eigenstates of C with eigenvalues +1 and —1, respectively, 


(19.18) Obn = +4r; Ch, = —pr 
In terms of the y matrices, the spin operators can be represented as 


ao, 0 


(19.19) on (5 2.) = Wes (cyclically). 


Since 6 and (y,H—yP) do not commute, the solutions of the Dirac 
equation (19.8) are not, in general, eigenstates of ¢ for arbitrary direction 
of the spin. Only if the particle is at rest, namely when Py = 0 so that 


4 Av = mw 
(19.20) Vs Abrest = Mrs, 18 { ine | 
which requires 
. uU 
(19.21) wav ie. prose = (‘) 


can ¢ be an eigenstate of 6. By convention, the representation 


(19.22) Gs rest = + rest 


will always be chosen. 

To be quite explicit, the state vectors (})¢; (?)¢ spanning the chirality 
space defined by the eigenvalues + 1 of C, and the statevectors (})s; (?)s 
spanning the spin space defined by the eigenvalues +1 of og in the rest 
frame of the particle can be introduced. A general electron state can then 
be characterized by the quantum numbers k, 8, C, where C stands for 
the dichotomic label R, L of chirality (right-handedness, left-handedness), 
and § stands for the dichotomic label ¢ , | , of spin in z direction (spin up, 
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spin down, in the rest frame), and such a state is generated from vacuum 
|0> by the appropriate creation operator, 


(19.23) 
|k,S,C> = at(k, 8, C)|0> 
where 
|k,S, 0) = |k,S, R) x (6), Iis8.L> x (i), 
092) = het. Rx(!) (2) 4 4.2% (°) (2), 


sts rtya() ai) atito=() 


In the coordinate representation 


(19.25) |k,9,C> = [ |a>day,s,0(a) 


the ¥ functions characterizing |k,S,R>; |k,S,L>; |k,t,R>; |k,},R>; 
|k, t,.L>; |k,|,Z>; are w; v; v1; v2; 01; v2; respectively. 

In the combined spin-chirality space (which has four dimensions), the 
operator of inversion of coordinates can be represented by (0 and 1 
stand for 2 x 2 matrices in spin space) 


S 01 
(19.26) ip i 0) =i 
because chirality C = —iy; must be odd under inversions, whereas the 


spin 6; = (y,y, (cyclically) must be even, and this is guaranteed by (19.26) 
because of the anti-C.R.s (19.11) and (19.13). One expects, therefore, 
the operator of coordinate inversion to be entirely represented by 


on 
(19.27) TT = n7ysHp = tae 0?) an 


where IT, acts on the dynamical variables such as H and P and represents 
the transformation q — —q in the y) functions, and y77 is a phase factor, 
which in accordance with the conventions adopted in Section 14 will be 
chosen so that #7 = +1. 

The operator of time reversal, on the other hand, should leave the 
chirality unchanged and must therefore be diagonal in chirality space. 
This is accomplished by putting 


(19.28) 6=TK = I nk 
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where 7’ is in spin space according to (15.42) of the form 
Lege al (0 -7@ ° 
(19.29) T = (ca 0) = i(; * = tog 


and K is the operator of complex conjugation, thus guaranteeing the 
correct transformation properties of the spin operator 6, namely 


066-1 = —6. In terms of the matrices y the operator 7 may be written 
ou _ [iog 0 
(19.30) t-rn=(S 2) 


so that altogether the operator of time reversal can be represented by 
(19.31) 9 = neviysxTpK 


where 7’, acts on the dynamical variables and represents the transfor- 

mation ¢ > —t in any time-dependent function, and y@ is an as yet 

undetermined phase factor, subject only to the condition yg 76 = 1. 
Eigensolutions of (19.8) will now be sought which satisfy 


(19.32) 
A = Qf and Py = kh with Q= +VR +m. 
It is understood that the energy Q is never a negative number. Using 


P = —1(0/0Q) one has in coordinate representation 
(19.38) (ys Q-+iy[8/Aq]) b = mp. 
By writing 
Ay 
(19.34) x,s,o(4) = A(k,S, 0) et = rn Cie! 
Ay 
one obtains four linear homogeneous equations for the A,,(k,...), namely 
(19.35) (ygQ—yk)A = mA 


which read explicitly, with representation (19.5), 

(Q+ks) Ag+ (ky—tke) A, = mA, 
(Q—kg) Ag+ (ky +key) As = mAy 
(Q—ks) A, —(ky —tky) Ay = mAs 
(Q+ks)Ag—(ky +tkg) A, = mA, 


The necessary and sufficient condition for existence of a nontrivial 
solution is the vanishing of the coefficient determinant 


(19.36) 
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(19.37) 
—m 0 (Q+ks) (ky —tke) 
0 =m (ky +ike) (Q—Ks) 
Oak. ey ae eee ee 
— (ky +k.) (Q+ks) 0 —m 


which is obviously true because of (19.1). 
For given energy 2 = + Vk®+m* > 0, Eq. (19.34) has two linearly 
independent solutions, namely 


(19.38) 
m (ky — tke) 
Vk, VQ—hy 
AQ ee 
O = ara a7ase | pene A ape 
— (ky +k) m 
O=hi QE, 


which have been normalized so that 

(19.39) A*(1)A(1) = A*(2)A(2) =1/V ite. [ vrddq =I) 
v 

and which satisfy the orthogonality relation 

(19.40) A*(1) A(2) = A*(2) A(1) = 0. 


The phase conventions have been arranged so that in the rest frame 
(k = 0) the states 


(19.41) 
1 0 
ay ie Dasa fhd 
A(1;k = 0) = —= and A(2;k = 0) = —— 
. )= av 1 = av \ 0 
0 - 
result, which are eigenstates of 6, with eigenvalues +1 and —1, 
respectively. 


Formally, Eqs. (19.36) have another pair of solutions if the energy 
is taken to be EH = —Q= — Vk? +m? < 0, so that there are altogether 
four linearly independent solutions, labeled A,(r) with r=1, 2, 3, 4, 
which are summarized in Table 19.1. They are normalized so that 
(19.42) 

A*(r) A,(r’) = (1/V) br (sum over Greek subscripts appearing twice) 
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In analogy to the development of the theory of photons in Section 18, 
resulting in the possibility of representing all observables as operators 
bilinear in creation and annihilation operators, it is often convenient to 


TABLE 19.1 


Tue Four Liyzarty InpEPENnpeEnT So.urions or Eq. (19.36) 


(Q = +V ik? +m?) 


1 —— Be = 
Via Vien oo 7* : 
ky t+ike —m 
2 OD VEEL esi dae ' 
x 
m ky — tke V2VQ2 


3 VQ—ks 0 


—(kitike) —_m 


VQ—ks VQ=ks 


work with space and time dependent Dirac field operators, which may 
be constructed in terms of the A,(r) as 


(19.48) p(w) = aaa > A(r,k) a(r, k) + ef(kat2o > A(r,k) a(r, x)| 
r=1 r=8 

where a(r,k) = a(k,r) is the annihilation operator of an electron with 

positive energy if r=1, 2, and of an electron with negative energy if 

r= 3, 4. In terms of this operator and its adjoint (x) = s*(x)y4, the 

energy operator for any many electron state takes the form 


(19.44) H = J €(@)dq = f B(-tyV +m) pdq 


aS = al 3 S at(r,k)a(r, k)— Ey at(r, (kav). 
r=) 

Not surprisingly, this expression is not positive definite, because 

solutions of negative energy have been admitted into the theory. This 

difficulty of interpretation is compounded when one considers the 

operator of electric current density 


(19.45) jue) = ebyy 
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which owes its definition to the observation that it satisfies a conserva- 
tion law 


(19.46) . [0j,,(")/Ox,,] = 0 
as a consequence of the Dirac equation and its adjoint, 
(19.47) —i(8/Ax,)y,p+mp = 0; — i(0/Ox,) Py, +mp = 0. 
If one writes down the electric charge operator 
4 
(19.48) Q =e f jalz)dq = eS Y at(r,k)a(r,k) 
k r=1 


one has now an expression which is positive definite as it stands. 

Wanted, of course, is just the opposite, namely a positive definite 
energy density and an electric charge which may have negative as well 
as positive expectation values. 

One famous way out of this dilemma is the hole theory of positrons by 
Dirac, who essentially availed himself of the possibility of performing a 
unitary transformation interchanging full states and holes as far as 
states of negative energy are concerned, in accordance with the procedure 
explained in Section 17, With the sign conventions implied by Kq. 
(17.25), this transformation amounts to the introduction of new operators 


a*(2, —k) = —at(3,k) replacing a(3, k) 
at(1, —k) = at(4,k) replacing a(4,k) 
so that, with the notation 
a,(r,k) = a(r,k) for r= 1,2 
Ax(r,k) = A(r,k) for f= 1,2 
A_(2, —k) = —A(3,k) 
A_(1, —k) = A(4,k) 
the operator (19.43) may be written 
(19.61) 
We) = SE [e-29 A (7, ) ay(r, ie) + ete Ar, —k)a(r, —k)] 


(19.49) 


(19.50) 


* 
& 
Mi 


=> 3 [etea-20) 4 | (rk) a4. (r7, k) + et _4_ (rk) at (r, k)]. 
k 


The subscripts (+) and (—) have been affixed to the operators a and the 
amplitudes A in anticipation of an interpretation associating the sub- 
script (+) with lepton number L = +1 and the subscript (—) with 
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lepton number L = —1. The amplitudes A;(r,k) according to (19.50) 
are collected in Table 19.2. For A;,(r,k) = AZ(r,k)y, one has the corres- 
ponding Table 19.3. 
TABLE 19.2 
Tue Ampuirupes A ;(r,k) 


(Q = +Vk2+m2) 


R 
= 
_ 
to 
_ 
i 


m ky —tke 
Ln | foam ines ® Viaths 
—m ki+itke 
S MOSS po TGs ; 
x 
—(ki—tka) —m V2VQ 


3 VQ—ks 0 


—(kit+ike)  m 


a ke 0 
Ol ares goers ee 
eel | hen 
TABLE 19.3 


Tue Ampuirupes A ,(r,k) = AZ(7,k) ya 
(Q = +Vk2+m2) 


1 2 1 2 
a — (kei + tke) -—m 
Vl=k <0 Vath, VO+ks 
—(k1—tka) om ae 
WoSnoshu : 1 
m_—_—_ (ka-+ike) bs OVERE “a2 
VQ—ks VQ—ks ed 
—m (ki —tke) 
° VO-b Tan VOI 
pa ] Tate iI 


From these tables one can construct immediately the matrices, needed 
later, 
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(19.52) > [4+(k,r)], [44(k, r)]p = 


a 0 (ue Ay 
0 RG OE 
Ma Seay a 9 |< GRYa) 
— (ky +tke) Q+ks 0 m 


and 


(19.53) © [4-(k, 7), [4—(k,r)]p = 


pe nt Gay aires 
0 i (haere 

Guay ein ME eet 0 ARIEMES), 

ach ate 0 a 


which in terms of y matrices may be expressed as 
(19.52) 5 [44 (k,r)], Ay (ks, r)p = (1/272) fy, Q—ky + mI leg 


or 


= Ask, 7) A,(k,r) = (1/2V.Q) (+m) 
(19.53) x [A_(k, r)]g [4-(k, 1), = (1/202) [y4@—ky —mI],g 
or 
& A_(k, 7) A_(k, 7) = (1/2V.Q) (k—m) 
; where k= ky yy, = ys Q—ky. 
With this relabeling, the energy operator takes the form 


(19.54) H=% S QL yk, r) + N_(k,r)—2] 
r=1 

where 

(19.55) Ni(k,r) = ah(k,r)a,.(k,r) 


are now interpreted as the number operators of electrons and positrons, 
respectively. The expression (19.54) is obviously invariant under inter- 
changes k«+ —k and/or r = 1,2«+r = 2,1. The conventions inherent in 
the notations (19.50) are such, however, that any positron state labeled 
(k,r) differs mechanically from the corresponding electron state labeled 
(k,r) only by its handedness. 

In this hole theory of positrons, the infinite vacuum energy — 25,2 
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is disregarded since it is the same for all states and thus, in principle, not 
observable. A modified definition of the electric current density which 
eliminates from the outset any nonvanishing vacuum charge, without 
invalidating the conservation law (19.46), is 


(19.56) jul) = (¢/2) bs yu] = (¢/2) Dy lep [oan bp —¥p Foe] 
giving in particular for the total electric charge 


(19.67) @ = eB ¥ Wyler) Nn] 


which is consistent with the interpretation of NV, and N_ as number 
operators for electrons and positrons, respectively.* 

It is instructive and useful for later applications to write out the 
operator (19.56) for the current density in terms of the operators 
ai,(k,r) and a.(k,r); using the anti-C.R.s of these operators one finds 


(19.56) 
(/2)%. 7.4] =e Xd XE fexp(—s[(k’ —k) q—(2'—2)¢)) 


Kk 
x A,(k',r’) y, A4(K, 7) at (k’, 7’) a,(k, 7) 

—exp (i[(k’ —k) q—(2'—Q) t]) A_(k’,r’) y, 

x A_(k,r) at(k,r) a_(k’,r’) 

+exp (—i[(k’ +k) q—(Q' +) t]) Ay(k’,r’) y, 

x A_(k, r) at(k’, r’) a*(k, r) 

—exp (il(k’ +k) q- (@’ +.2)t)) _k’,r')y, 

x Ay (I, 7) a4(k,r) (hk, r’)} 

=¢ ~ x [A,(k, 7) Y,A+(k, 1) — A_(k, r) Yp 4-(k, r)]. 


The last term, not containing any operators a or a*, vanishes for all p, 
as can be seen by straightforward computation from Table 19.3, and 
(19.57) follows as a special case for p = 4. 

The ingenious hole theory, necessitated by the admittance of negative 
energy solutions into the theory, can be avoided altogether from the 
outset, if one exploits a peculiar symmetry property of the Dirac 
equation which is known in the literature as charge-conjugation sym- 
metry. By this is meant that if one interchanges the number operators 
N4 and N_ 


(19.58) N,2N_ 


*If one uses instead of (19.42) a relativistically covariant normalization to Q/m 
particles per unit volume, then, in the definition (19.51), a factor Vm/Q is required. 
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then all mechanical operators, such as P,,, are not affected, whereas the 
operator j,,, with the understanding that e is a numerically fixed parameter, 
changes sign. This property is already manifestly incorporated in 
expressions (19.54) and (19.56). 

By a curious and rather remarkable property of the Dirac equation 
and its solutions, not shared, for example, by the equation governing 
the neutrino, the effect of (19.58) can be represented by an operator Im 
which affects only the spin-chirality space, so that one may write 
instead of (19.58) 


(19.58) pow = orl = apply foe = aT = ny 
where J’ does not affect the creation and annihilation operators, and 
where np is an as yet undetermined phase factor subject only to the 
condition npyf = 1. A representation of Fin terms of the matrices Ye 
may be obtained by requiring y’ to satisfy the Dirac equation 

(19.59) [—ty,(0/0x,) +m] p =0 


provided y and # satisfy the same equation, i.e. (19.47), respectively. It 
turns out that this condition is sufficient to ensure the invariance of P,, 
and the change in sign in j,,. Substituting in (19.59) for #’ one finds 


(19.60) Pl-ify, P(af/ax,) +m] = 0. 
This is a consequence of the second equation (19.47) provided 
(19.61) 

po Ml = —Yps in components iy, Tle = —[yp]ee 
With the representation (19.5) one has 


_f tlle for p= 2,4 
(19.62) [Yplap = { [le for w= 1,3. 
One can thus satisfy (19.61) by choosing 
Dineed) lO) 0 
Ftemsils. ae tte 08 100)\ eee Lad Ore 0} aad 
(19.63) T= tyovs - i( 0 a) 6.01.0 ll 
Le § 0 
so that 
(19.64) haa deo) hee 


and I’ becomes unitary, 
(19.65) fas ff,” ie. Trt aT 
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The proof that j,, changes sign under the operation yy’ is now straight- 
forward, Using (19.64) and (19.61) one finds 


(6/2) [Fs 741 = (6/2) Ey laplPa oe — Up Val 
~ (6/2) [yp apll "Vay P aly Fs — Fo by] 
(6/2) [P "yal 7plapl pal ty —ty Bs] 

— (¢/2) [yplsylPs Py —Yy Ps] = —Jp- 


It should be stressed that if one wishes to replace the arrow in Eq. 
(19.58), i.e. if one wants to formally define an operator I’ connecting 4’ 
with by 


(19.67) w= Dp = ap lh = np lt ys = apy. Ky 


then this operator must be antiunitary, 


Iu 


(19.66) 


(19.68) T= aplygK = npiyek. 
Since y, is pure imaginary and y) 2 = —I, and because ny} = 1, one has 
(19.69) I? = iysKiy,K = —yoy2 = +1 


and no superselection rule is generated in the electron-positron system by 
pure charge-conjugation symmetry. However, if one considers now the 
operation of combined inversion, using (19.27), 


(19.70) 2 = MD = gg arysp iy, K 
one notices that 2 too is antiunitary, and has the additional property* 


(19.71) 2? = tysyKiysyeK = —ysyoyaye = vayt = —I. 


Thus the spaces spanned by the state vector |...Ny(k,r)...> and its spatial 
inverse Z|...Nz(k,r)...> are separated by a superselection rule, which in 
view of the fact that I interchanges L = —L may be interpreted as the reason 
for the conservation of lepton number L as far as electrons and positrons are 
concerned. 

One may thus from the outset circumvent the introduction of negative 
energy states entirely, by considering only positive energy solutions of 
the Dirac equation and defining positron solutions as obtained by the 
operation of combined inversion Y from corresponding electron solu- 
tions. The consistency of this procedure is borne out if one computes with 


* Note that this would be true even if one had chosen the convention nh = —1, because 
nr enters 2? only in the combination y7 7h. 
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the representation (19.70) the effect of 2 on the first two columns of 
Table 19.2, namely 


(19.72) 2As(k,r) = an 4a(-k,n). 


The effect of combined inversion is thus a change in sign of the momen- 
tum k and the replacement of lepton number L = +1 by lepton number 
L = —1, The operator (19.51), in particular, may be written 


(19.73) le) = BE + 2) Ay (4 (7) 
r=1 


with the understanding that the operator of combined inversion applied 
to an annihilation operator has the effect 


(19.74) Za41(k,7) 2? = gn grat(—k,r). 


These considerations strengthen the point of view, already expressed 
on the occasion of the corresponding development for neutrino and anti- 
neutrino, that an antilepton should be considered as the spatial inverse 
of the lepton, and not as its time reverse as had been suggested by 
Feynman. 

The operation of time reversal, when applied to electrons, is also an 
antiunitary operation, satisfying, since y; and ys; are real and satisfy 


vi=ve = —L, 
(19.75) 0? = y1y3KyiysK = yiysviys = —vevi = —1 


so that time-reversed states of electron states are also separated from 
electron states by a superselection rule. One is tempted to speculatively 
identify such states with the corresponding muon states, so that conser- 
vation of muon number L,,, already mentioned in Section 17, flows from 
that particular superselection rule. Such speculation intensifies the riddle 
posed by the muon’s mass which, except for the number Lys seems to be 
the only attribute by which a muon can be distinguished dynamically 
from an electron. Since at present there exists no satisfactory dynamical 
theory of the masses of elementary particles, one cannot dismiss the 
possibility that in the actual physical world the masses of electron and 
muon are connected by the transformation property m, =@Om,@-. 
Dirac’s equation in four-component form (19.8) looks deceptively 
simple. The wealth of information contained in it is brought out more 
transparently when one attempts to write the equation in two-component 
form. To this end, consider again Eq. (19.2) and remember that, according 
to (19.21), the two two-component functions w and v are equal in the 
rest frame, i.e. whenever the eigenvalue of P vanishes. Knowledge of the 
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two-component % function w is thus sufficient in the rest frame, and it 
ought to be possible to find a two-component w, say, from which all 
information contained in the four-component i can be extracted. 

To be precise, one wants to find a two-component y function w, 
taking the place of the four-component function y, and 2x 2 matrices 
representing any operator F’, taking the place of any given 4 x 4 matrix 
representing an operator /, so that the expectation value of F in the 
state w and of F in the state ys are equal, 


(19.76) CHIP > = <wlF|w> 
provided the normalizations 

(19.77) <p> = 1 
and 

(19.78) <w|w> = 1 


are adopted. Strict adherence to the condition (19.78) is decisive for the 
consistency of the procedure. 

To actually carry out the transition from a four-component to a two- 
component description, remember that finite momentum P means v and 
w are not equal and write 


(19.79) v = (I+W)u 


where W is a 2x 2 operator that can, in principle, be derived from Eq. 
(19.2), as will be shown by successive approximations below. The desired 
two-component function w likewise must be obtainable by some opera- 
tion G, say, applied to u, 


(19.80) w= Gu; u=G@ w. 


The connection between G and W is culled immediately from the nor- 
malization conditions (19.77) and (19.78), which read 


(19.81) <p|p> = <uluy+<olu> = <ul + (1+ Wt) (1+ W)\u> =I 
and 

(19.82) <wlwy = <ulG*Glu> = 1. 

This can be true only if, up to some phase, 


(19.83) GG = 1+(1+W*) (1+). 
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Now the most general 4 x 4 operator is of the form 


(19.84) f= Pu Pu (Fy, are 2 x 2 matrices) 
Poy Foo, : 


so that its expectation value can be written in terms of two-component 
functions 


(19.85) 
bP | P> = <ul Prs|u> + (ul r2|v> + <o|Far|u> + <e| Feel) 
= (ulPy, +P yo(I + W) + (1+ Wt) Poi + (1+ W*) Foo(I + W)|u> 
= (w|(4*) [Fit F iol + W)+ (1+ W*) Fey 
+(I+ W*) Pool + W)1G|w». 


The condition (19.76) is therefore satisfied provided one uses as 2 x 2 
operator F the expression 
(19.86) 
Fes (Pit Fil + W)+ (0+ W*) Por + (0+ W*) Pool + WG. 

The remaining task is to find expressions for G and W from Eq. (19.2). 
By substitution of (19.79) one finds the operator equations 

H-oP = (I+W)m 
(19.87) 
(H+oP)(I+W) = mI 


so that one has simply 


(19.88) I+W =1+W*t = (1/m)(H-oP) 
and therefore by (19.83) 
(19.89) G@ = -i(I+W))+i0+ W)). 


Thus, up to some arbitrary phase, 
(19.90)  @ = I+i(1+W) = V2e"/ (7+ (1/2) th W] 
(19.91) @t = I-i(1+W) = V2e% [14 (1/2) e W)). 


The expressions for G~! and (@*)~! needed in (19.86) can now be 
obtained as a series in powers of W. Keeping terms up to order P*/m’ one 
has the approximations 


(19.92) H = Vm? +P? = m+(P2/2m)+... 
(19.93) W = (1/m)(H—oP—m) = —(oP/m)+(P?/2m?)+... 
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(19.94) W? = (P?/m*) —(P2oP/m?) +... 
(19.95) W* = —(P*oP/m)+... 
and from these follow 
(19.96) 
G@-? = [(1—1)/2]1—(1/2) W+[(1+4)/4] W2-(i/4) W3 +... 
= [(1—#)/2] 1+ (oP/2m) +7(P?/4m*) —(P? oP/4m*) +... 
(19.97) 
(G*)-? = [(1+4)/2]1 + (6P/2m) —i(P?/4m?2) — (P2 oP/4m) +... 
Collecting terms one finds for (19.86) 


(19.98) 
F = (1/2) (Py, +Fi2+Fo1+F 29) 
+ (1/4m) {[(P11 —P 22) oP + oP(F' 1, —F'22)] 
+7[(P 41 +P 20) oP—oP(F 1, + F22)] 
+[(P2i— Fiz) 6P —oP(F'2, —F y2)] 
+i[(Po1 + Fy) oP —oP(F2; + Fy2)]} 
+ (1/8m*) {—[(Fi1+Foo)P?+P2(F 11+ F22)] 
+1[(Fi1 —F oo) P? —P?(F 1, —F 22)] 
—[(Pe1+Fi2z)P?+P?(Fe1+Fi2)] 
+1[(Po1 —Fy2) P? +P*(Fo:—F 12)] 
+ 2[oP(F1,+F 22) oP —ioP(F,; —F >) oP]} 
+ (1/8m*){—[(Fi1— Foe) P? oP +P? oP(F 1, —F 22)] 
—i[(Fi1 + Poe) P? oP —P* oP(F's1+F'29)] 
+[(P21 —F'o1) P? oP —P* oP(F12—F'21)] 
—i[(Py2+F21)P? oP —P? oP(F 12+ Fe1)] 
—[oP(Fy2—F21) P? —P*(F12—F 2) oP] 
+i[oP(P 11+ P22) P?—P*(F 1, +F 22) oP} 
+terms of order (P*/m*) and higher. 
This formula ought to be sufficient for any practical purposes. It is valid, 
incidentally, even if P is the operator of momentum of a Dirac particle 
in a fixed external field, to be explained in Section 20. 
As a simple example the matrices y are given in two-component form 


in Table 19.4 for the field free case and under expansion up to and in- 
cluding the power P?/m?. 
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TABLE 19.4 


REPRESENTATION OF THE Matrices y IN Two-Component Form, uP To TERMS OF 
OrpER P2/m® InciusiIvE 


(It will be noticed that the operators a can be interpreted as representing the vector 

of kinematic velocity. This completes the emancipation of the momentum from the 

velocity, which goes like a red thread through the development of classical mechanics 
in the 19th century.) 


> 
eI 
i=} 
by 
8 


Fig Fa F 


Y 0 0 -e @  (1/2m)[o(6P)—(6P)o] = —(i/2m)(ox P) 
ya 0 0 1 1 (1L=(P2/2m2)] 7 
a= yy oe -G 9 0 — (1/2m)[(oP) + o(GP)] = (P/m)I 
ys 1 -1 0 0 (oP)/m 
NOTES 


Dirac [1] found the equation which bears his name. 

The conventions used regarding Dirac’s equation agree with the ones 
given by Feynman [2]. 

Dirac [3] proposed the hole theory of positively charged electrons to 
circumvent difficulties arising from the states of negative energy, which 
had been examined earlier by Oppenheimer [4]. 

Charge conjugation as a symmetry operation was apparently first 
proposed by Kramers [5]. 

Becker [6] gave the first correct reduction of Dirac’s equation to two- 
component form, See also Chraplyvy [7]. 

Earlier work by various authors following a procedure by Darwin [8] 
is incorrect, because the two-component ys functions used by Darwin 
are not properly normalized, giving rise to nonhermitean terms in the 
two-component Hamiltonian. 
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_——_— SECTION 20 


The Lack of Sufficient Reason for Actually Existing 
Interactions 


The apparent capriciousness of nature which provides just four sup- 
posedly basic interactions of widely disparate strengths is matched per- 
haps only by the equally enigmatic apparent arbitrariness exhibited in 
the mass spectrum of the so-called elementary particles. 

The various classical responses to the baffling mystery of interaction 
have resulted in tenable edifices, such as Kinstein’s theory of gravitation, 
and in failures, such as Hinstein’s unified field theory. This type of 
approach to the problem of interaction has now generally been aban- 
doned, partly because gravitation and electromagnetism turned out to 
comprise only a fraction of observable interaction phenomena. 

The experimental exploration of the so-called strong and weak inter- 
actions, which appear empirically to be as disconnected as are gravi- 
tation and electromagnetism, has gathered evidence on a vast scale since 
the advent of quantum mechanics, lending urgency to all those attempts 
which cast among the tenets of quantum mechanics for a vehicle to which 
a theory of interactions might be attached. 

The arbitrariness in the phase of a state vector, in particular, has been 
a favorite starting point for efforts aimed at deriving the specific form of 
actually existing interactions from invariance arguments. A typical line 
of reasoning runs as follows. 

Consider a single particle state |1 = a+|0> characterized by a set of 
functions ¢,(q) so that 


(20.1) W, = | #4(a)¢,(a)dq = 1 

and 

(20.2) Ray e%w’ = f $8(q)¢,(4)dq = complex number 
so that 


(20.3) P,,. = R2, = Probability for finding the values r’ of the observ- 
ables if the particle is known to be in the state 
characterized by the values r. 
169 
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In terms of the eigenstates of momentum k, for example, and their 
y functions y,(q), any such state can be obtained by a linear superposition 


(20.4) Pr(Q) = Bi Cur) Hula) 


and one can write in terms of the expansion coefficients 


(20.5) W,= x leu(r)|?5 Rpg ef¥e = x C(t") (7) 


Now the following question arises: If one writes 
(20.6) $,(q) = 4,(q)e; —4,(q) and a,(q) real functions, 


to what extent is the phase «,(q) determined by the two quantities which 
have an observable meaning, namely W, and R,,,? 

Clearly, W, is independent of «,(q), and the phase remains undeter- 
mined by W,. The only requirement imposed by given R_, is that the 
integral (20.2) must have a definite modulus. Consequently, the inte- 
grand, although it need not have a definite phase at each point, must have 
a definite phase difference between any two points in space, whether 
neighboring or not. This follows, by generalization, from the simple rules 
governing the addition of complex numbers. Suppose one wants to have, 
in the sum R(cosy+isiny) = R,(cosy;+isiny;) + R2(cosys+isinys), 
only R determined, but to allow y to remain arbitrary. Using the elemen- 
tary formula R? = Ri +R3+2R, R,cos(y,—ys) one sees that, for given 
R, only the phase difference y, —y2 is determined. 

Thus, the change in phase of 4*(q)¢,(q) along a closed curve must 
vanish, This requires then that the change in phase of $,(q) along a 
closed curve shall be opposite and equal to that in 4*(q) and hence the 
same in all $,(q). Result: 

The change in phase of a yb function along a closed curve must be the same 
for all functions, independent of r. 

In other words, the change in phase along a closed curve must be 
something determined by the dynamical system itself, independent of 
the particular state considered. This suggests exploiting the nonintegra- 
bility of phase to accommodate features of the environment, such as 
provided by some external field in which the particle moves. 

To investigate this possibility write, generalizing to dependence in 
space and time, 


eae @, (x) = Bo(ar) ef 


where ®%(2) is an “ordinary” ; function, i.e. one with a definite phase at 
each point « = (q,t) in space-time, and the indeterminacy in the phase 
is put into the factor ¢®, Tt will be noted from the foregoing that A(x) 
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is not required to be a function of x having definite values at each point, 
but B(a) must have definite derivatives, 


(20.8) Ky = 0B/ax, 


at each point, which do not, in general, satisfy the integrability condition 
Ax,/ Ox, = Ox,/Ax,. Now the change in phase, around a closed curve, 
should be observable because R,, depends on it. In four dimensions 
this change in phase is, by Stokes’ theorem, 


(20.9) § kya, = J J [(Ar,/82,) — (A, /22,)] dS, 


where dS,,, is the skew tensor element of the surface bounded by the curve. 
It is now very tempting to identify the derivatives of the phase B with 
the electromagnetic potentials, so that 


(20.10) x = —eA, 
and 
(20.11) (0x,/0x,) —(Ax,/Ax,) = —eF ,, 


can be identified with the electromagnetic field tensor, if e stands for the 
numerical value of the electric charge of the particle under consideration. 
The homogeneous Maxwell equations 


(20.12) €\u(OF yy/) = 0 


are then automatically satisfied, and are equivalent to requiring that the 
right-hand side of (20.9) must not depend on which surface bounded by 
the curve given on the left-hand side is taken. Indeed, if one has two such 
surfaces, then the difference in the integral [/',,,dS,,, over them will be 
given, by Gauss’ theorem, as 


(20.13) A f F,dS,y = J €cdun( OF y| Oxy) de 


where the integral on the right is taken over the volume between the two 
surfaces, and (20.12) is necessary to guarantee the vanishing of this 
expression. 

The identification (20.10) gives rise to some observable effects whose 
importance for the quantum mechanical concept of interaction was first 
realized by Aharonov and Bohm, and which are bound to startle anyone 
who has been brought up with classical electrodynamics. For example, 
if a coherent beam of electrons is taken around both sides of a solenoid, 
an interference pattern is observed which will shift continuously with 
continuously varied flux F through the solenoid. This is predicted by the 
identification (20.10), because if one considers the part of the % function 
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at the point of interference which is a linear superposition of two 4 
functions corresponding to path 1 taken around one side and path 2 
taken around the other side of the solenoid, 


(20.14) 
$= dich+doe;  B, = ef Adg; fr = -e f Adq 
1 2 


then the interference between the two beams will depend on 


(20.15) Bo—Bi = ef Adq = 


where the integral follows the closed path formed by paths 1 and 2 
around the solenoid, and /' is consequently the entire flux through the 
solenoid. 

Since this effect will occur even though the electron beam may never 
enter any region in which the electromagnetic field is unequal to zero, 
Bohm and Aharonov have argued that in quantum mechanics potentials 
acquire the status of observables which they do not have in classical 
electrodynamics, pointing out that this conclusion is unavoidable if one 
wishes to adhere to the concept of local interactions as a basic require- 
ment, 

One can, however, formulate quantum electrodynamics without the 
use of potentials, if one admits the kind of nonlocality inherent in the 
very notion of a path-dependent phase, as has been shown by Mandel- 
stam. It would appear, then, more reasonable to accept the Bohm- 
Aharonoy experiment as an indication of a profoundly nonlocal feature 
acquired by ys functions in an electromagnetic field. This feature can be 
extracted from the y function (20.7) in yet another fashion. By dif- 
ferentiation one obtains 


(20.16) (A,/dx,) = e'[(A/Ax,) +ix,]O2 = e'B[(A/Ax,) —ieA,] O°. 


It follows that if ®, satisfies any equation involving the operator of 
momentum-energy P,, = i(0/d,), then ©? will satisfy the corresponding 
equation in which P,, haa been replaced by P,,+eA,,. On the basis of the 
identification (20. 10), one would then have to Ronabel: that the ys function 
® always satisfies the same equation, whether there is a field or not, and the 
whole effect of the field is in making the phase nonintegrable. This is equiv- 
alent to having the “ordinary” part ®° of the % function, namely the 
part having a definite phase, satisfy the equation with P,, replaced by 
P,,+eA,, In particular, the Dirac equation for an electron i in an electro- 
magnetic field now reads 


(20.17) Yp(Pyt+eA,) ye = mp 
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where ¢ is now meant to have a definite phase at each point in space and 
time. 

It will be noticed that this equation is invariant under phase trans- 
formations involving single-valued functions (x) 


(20.18) yp > pela 


provided transformation (20.18) is accompanied among the vector 
potentials by a gauge transformation 


(20.19) A, > A,+(@A/ax,). 


This invariance property can be made manifest if one introduces the 
explicit path dependence of ® 


(20.20) @(x,P) = B(x)exp[—ih | A,(€)dé,,] 


3s 


and the gauge-invariant derivative 
(20.21) 0, P(x,P) = lim {[O(~+dx,,P’)— P(x, P)|/dx,} 
dxy—>0 


where the path P’ is obtained from P by giving it an extension da, in x, 
direction, i.e. P’ passes through the end point x of the path P. The Dirac 
equation can then be written 


(20.22) iy, 0,0 = m®. 


It should be kept in mind that the operators 0,,, 0, do not commute if a 
field is present, 


(20.23) (8,,2,—0,0,)® = —ieDF',,. 


As a calculational aid, the electromagnetic potentials are, of course, 
always extremely convenient when one seeks to actually solve Eq. 
(20.22). 

The rule of replacing P,, by P,, + ¢A,, in presence of an electromagnetic 
field has been known for a long time, and its success in giving right 
answers in empirical situations where the representation of the electro- 
magnetic field by a classical potential is meaningful, is well known. The 
comparison of the energy levels in a Coulomb field resulting from (20.17) 
with observation can be found in practically all texts on quantum 
mechanics. The good approximations to reality provided by this 
empirical rule are the more astonishing in view of the number of arbitrary 
features characterizing this ‘‘derivation’’ of that rule by the line of 
reasoning employed above. 

Perhaps the most serious shortcoming of the argument leading to the 
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identification (20.10) is the complete arbitrariness of the value of the 
parameter ¢, identified as the electric charge of the particle considered, 
which could be set equal to zero, for example, thus making the entire 
discussion up to this point an empty exercise. In other words, no suf- 
ficient reason for the existence of either charged or uncharged particles 
has thus far been advanced. It is, in particular, not at all clear why one 
could not have, in analogy to electrons satisfying (20.17), neutrinos 
coupled to the electromagnetic field satisfying the correspondingly 
modified neutrino equation o,(P,, +¢A,,)} = 0. 

In this connection, it seems worth recalling a curious argument by 
Dirac based on the fundamental indeterminacy modulo n2z7 (n an 
integer) in the phase of any complex number. Considering the single- 
valued part ®° of any # function, Dirac argues that the change in phase 
around a small closed curve must be small and cannot therefore be a 
nonvanishing multiple of 27, because in the limit of an infinitesimal 
circuit this would conflict with the continuity of ®. There is an excep- 
tional case, however, when ®° vanishes, since then its phase does not 
have a meaning. Since ®° is complex its vanishing will require two 
conditions, so that in general the points at which ®(x) vanishes will lie 
along a “nodal line.” From continuity, one can now only infer that the 
change in phase along a circuit around a nodal line must go over into 
n2r in the limit. This integer n will thus be a characteristic of the nodal 
line, and its sign can be associated with the direction of the circuit, which 
in turn may be associated with a direction along the nodal line. If one 
considers now a large circuit in space with a number of nodal lines 
passing through it, then the total change in phase along the curve is 


(20.24) AB = 2n Dn, +e ff B-dS. 
i 
Applied to a closed surface, (20,24) must vanish, 
(20.25) Yn, = —(e/27) ff B-dS (c.s. means closed surface) 
ea. 


If } »; 40, some nodal lines must have end points inside the closed 
es 


surface. Thus the endpoints of nodal lines, if they exist, must be the same 
for all % functions, and represent sources of magnetic flux 47f = (27m/e), 
where f is the strength of the magnetic monopole at the end point, 


(20.26) f = (n/2¢). 


These considerations do not, of course, show that such nodal lines with 
or without end points must exist in nature. As in case of electric charge e, 
the argument leading to the prediction of f does not contain sufficient 
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reason for the existence or nonexistence of any finite value of n. All 
experimental evidence available to date indicates that magnetic mono- 
poles do not exist in nature. 

Equation (20.26) is, incidentally, invariant under inversion of co- 
ordinates and under reversal of motion only if the nodal characteristic n 
changes sign under either transformation, independent of whether e does 
or does not change sign under either transformation, because of the 
opposite transformation character of electric and magnetic fields. 


NOTES 


Probably the first attempt to find the reason for the electromagnetic 
interaction in the invariance of the Schroedinger equation under phase 
transformations is due to London [1]. 

Dirac [2] used the indeterminacy modulo 27 in the phase of any single- 
valued y function to speculate on the possible existence of magnetic 
monopoles. 

Aharonov and Bohm [3] drew attention to experiments which demon- 
strate directly the nonintegrability of phase in presence of electromag- 
netic fields. The experiment of Aharonov and Bohm with magnetic flux 
enclosed by a split electron beam was actually performed by Chambers 
[4]. 

Mandelstam [5] has given a formulation of electrodynamics without 
potentials, and shown in which sense the path dependence of phase 
implies a basic nonlocality of the y function in presence of electromag- 
netic fields. 

Readers who wish to review at this point the comparison with experi- 
ment of the solutions of Eq. (20.17) as applied to atomic hydrogen may 
find the slim booklet by Series [6] particularly concise and comprehensive. 
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*SECTION 21 


The Idea of the Compensating Field 


Since the requirement of invariance under phase transformations 
(20.18) necessitates the presence of some field which compensates by a 
gauge transformation (20.19) the effect of (20.18) on the equation 
governing the % function, some authors have attempted to elevate a 
generalized form of gauge invariance to a fundamental principle, and 
have sought to find in the idea of the ‘‘compensating field” the raison 
d’étre for actually occurring interactions. The formulation of this 
approach is greatly aided if one uses as starting point a so-called action 
principle 


(21.1) J ou, (Ob/Ox,)]d*a = Extremum 


from which flow the equations governing the % functions as Euler- 
Lagrange equations of this variational principle, 


(21.2) (8L/5p) = (AL/Of) — (0/Ax,) [8L/0(04/Ax,)] = 0. 


For example, the “Lagrangian” L giving rise to the Dirac equation and 
its adjoint by the recipe (21.2) is 


(21.3) Ly = ify,,(8y)/0x,,) — mrp. 


It should be understood that introduction of (21.1) is a purely formal 
device, and the step leading from (21.3) by (21.2) to the Dirac equation 
can in no sense be considered as a “derivation” of this equation. 

Suppose now one wants to insist on the invariance of Z under unitary 
transformations of the type 


(21.4) U = exp [ie,(«)8,] 


where 8, («=1,2,...,) are » hermitean operators and e,(z) n real 
functions of space and time. The idea is to associate the operators S, with 
internal properties of elementary particles, such as electric charge, 
isospin, ete. For example, one may visualize two spatially separate 
observers with different conventions about the labeling of nucleons as 
neutrons and protons looking at the same event involving strong inter- 
actions which are invariant under changes in these conventions, i.e. 
177 
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invariant under rotations in the abstract isospin space. The generators 
of space time transformations associated with dynamical rather than 
internal properties of elementary particles are expressly excluded from 
consideration at this point. The coordinates x refer to a flat space-time 
continuum in which displacements, in particular, are integrable. If one 
makes the parameters characterizing space-time transformations, such 
as translations, rotations and Lorentz transformations, coordinate 
dependent, then the generators S, will, in general, not commute with the 
parameters ¢,(x), and one is forced to consider nonintegrable, curved 
spaces. This question will be taken up in some detail in Section 22. 

The transformation (20.18) is the special case with n = 1, S = el and 
(x)= A(z). The Lagrangian (21.3) is then not invariant as it stands, 
because 


(21.5) U-*(/ax,,) U = (/Ax,) + U-(0U/2x,). 


This deficiency of Lp can be repaired, however, if one introduces a 
compensating field B, which transforms as 


(21.6) Bi, = U"B,U +(i/g) U-"(0U/ax,) 
so that, if transforms as 

(21.7) y=07g; P= UP 
then 

(21.8) 


Ly = ify, [(0/0x,)—igB,\yp— mays = if’ y,[(0/Ox,) — ig By |p! — maf" 
is now invariant under the transformation (21.4). In the special case 
mentioned above, the still arbitrary coupling parameter g will then have 
to be identified with the numerical value of the electronic charge e, and 
B,, with the vector potentials A, of electrodynamics. 

Tt is evident that one cannot introduce scalar fields through this 
requirement of gauge invariance. Any compensating field must, by 
necessity, be a vector field. Therefore, if compensating fields are accepted 
as the primary agents of interaction between fermions, the pion field, 
which for a long time was thought to be the “glue” which holds the nuc- 
leons together, will have to be demoted from the ranks of fields whose 
quanta are considered as elementary particles. Any gauge theory of 
strong interaction requires, in principle, vector mesons as agents of 
interaction, and in such a theory one must seek to obtain pions as 
composite particles, for example as bound states made up out of nucleons 
and antinucleons as had originally been surmised by Fermi and Yang. 

The transformation formulae (21.5) and (21.6) are, in general, rather 
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complicated, since they depend on the C.R.s between the operators S,. 
One has the expansions 


(21.9) 
U-"(QU/Ax,) = i(Be_|8x,) Sq — (i?/21) €g(dep/02,) [Sq Sp] 
+ (i8/3!) e, ¢p(A,/22,) [Sas [Sp,S,]]— +... 
(21.10) 
UB, U = B,—ie,[8,, By) + (i?/21) €s €{Sq, (8p, Byl]— +... 


If one were to regard local gauge invariance as embodied in the trans- 
formations (21.4) for any operators S, as compulsory, one would in fact 
be faced with an embarrassment of riches regarding possible compensating 
fields. Some physical reasons are needed to restrict the large variety of 
possible formal choices for compensating fields. With an eye on such 
reasons, to be given in the form of some examples later, one can attain a 
substantial reduction in complexity if one restricts consideration to 
operators S,, which satisfy C.R.s of the type 


(21.11) [Sa»Sg] = Oza, 5, 


The “structure constants” C,,,,, of the set of transformations S, are then 
independent of the representations used for the S,, and satisfy the rela- 
tions 


(21.12) Cupy + Chay = 0 
and 
(21.13) Cape Coys + Cpye Oca + Cyne Cops = 0 


where (21.12) is a consequence of definition (21.11), and (21.13) follows 
from the Jacobi identity satisfied by triple commutators. Transforma- 
tions that are isomorphic to the Euclidean rotations in three dimensions 
and four dimensions, and Lorentz transformations are special cases of 
(21.11). 

Introducing without restriction of generality new field variables B,, ,,. 
which commute with all S,, by 


(21.14) B, = 8. Bey 
one can consolidate the two expressions (21.9) and (21.10) into 
(21.15) U~*(QU/0x,,) = 8,[i(A¢,/Ox,,) — (7/2!) eg(Ae,/82x,,) Opya 

+ (13/3!) €g€,(0e5/0%,,) CyseCpey— +--+] 
(21.16) UB, U = S8,[Ba,y— tp By,u Cpye 

+ (i?/2!) pe Bs, pCyse Ong +. -:): 
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Therefore, (21.6) is satisfied for infinitesimal gauge transformations if 
the B,,,, transform according to 
(21.17) 
8Bay = Bau — Bap = 1€p By, u Cpya + (1/9) (¢a/O%,). 


This formula gives a clue to the construction of the gauge-invariant 
Lagrangian L, for the field B,, ,, which will give the field equations satis- 
fied by the compensating field, and which should in the special case of the 
electromagnetic field coincide with the well known Lagrangian 


(21.18) Ly = -P Fy 
containing only the gauge-invariant fields 
(21.19) F,,, = (0A,/0x,)—(04,,/0x,). 
Denoting this unknown Lagrangian with 
(21.20) 
Lg = L5(Bays Bepy)s — Ba,uiv = (0Bu,u/Oy) 


one requires as a consequence of the invariance under transformations 
(21.17) 


(21.21) 

8Ly = (OLp/0By,,) 8Bu, y+ (OLp/OBa, uly) Ba, uv = 9- 
Substituting (21.17) and (note that 6 and 0/dx, commute) 
(21.22) 

By, piv = 4p By, wiv Opye + *(8ep/2,) By, w Opya + (1/9) (07 €,,/02, Ax.) 
and collecting coefficients of ¢,, J¢,/0x, and 0° «,,/0x, 0x, one obtains 
(21.28) CupylBp, w(OLp/OBy, y) + Bp, yiy(@L5/OBy, w\v)] = 0 
(21.24) (@L;/0B,,,) +19O apy Bp, (OLp/OB,, y\,) = 0 
(21.25) (AL, /OBy, pp) + (@Lp/OBg, 4.) = 0 


Equation (21.25) can be satisfied only if the derivatives of BenterL;in 
the combination By, tujy) = Be, vp — Be, wiv» and, from (21.24), it follows 
further that B,,;,),) enters Lp only in the combination 


(21.26) 
Gy» = Bajuii— (1912) CByal Bp, p By, v— Bp,» By, u) = — Ga, vps 


Equation (21.23) is not sufficient to determine L, uniquely. However, 
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if one wishes to obtain a general expression which contains (21.18) as 
special case, one must introduce the “field strengths” 


(21.27) 

Gy = (0B,/Ax,) —(0B,/Ax,) —i9(B, B, —B, B,) = 8,4, 
and write 
(21.28) 


oY 


i 


Ly = —}traceG,, Gy, = —26,, uv %,,, trace S,, Sg 


= 4G, uy Fp, wy Coys Coyp- 
The proof that L, does indeed satisfy (21.23) is cumbersome and will be 
omitted here. 

The entire Lagrangian describing the mutually interacting ¢ fields and 
B-fields is now 
(21.29) L=Ly+L, 
where the derivatives of }s occur in the combination [(0/0x,,) —igS,, By, . 
so that 
(21.30) [AL p/0(Op/Ax,)1S. = (i/g) [2Lp/2B,, u1- 


Incidentally, the arbitrary coupling parameter g can alternatively be 
introduced as a factor with which L,, may be multiplied without changing 
the conclusions of this section. Formally, this is equivalent to introducing, 
instead of By, the fields B, = gB,, as variables, so that 


O.% = [(0/dx,)—iS, By ,Jy and Lg = 9 'L,(B’). 


Now, the invariance of L under an infinitesimal gauge transformation, 
transforming B,, ,, according to (21.17) and ¢% according to 


(21.31) by = ie Sap 

(21.82) df = —ie, Sy 

(21.33) 8(Oyp/0x,,) = 1(Ve,/02,) Sy + te, Sq (Oyp/Ax,) 
leads to a conservation law of “current” 

(21.34) (AJ, ,/0x,) = 0, 


which can be derived as follows. 
Since LZ, had already been chosen so that 6L, = 0 under that trans- 
formation, one need write down only 
(21.35) 
BL = (ALp/Of) 5) + (AL p/Op) 8p + [AL /8(AY,]Ax,)]3(0/2xr) 
+ (AL p/0By, ») 8Bo, ==an0} 
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Use has been made here of the fact that Ly contains neither (d%p/ dx,,) nor 
B,, |». The first term vanishes because of the field equation 


(21.36) (8L/8p) = (ALp/af) = 0 
and the remainder reads upon substitution of (21.17), (21.31), and (21.33): 
(21.37) 
€x{(OLp/Op) 8, + [BL p/O(Oyp/x,,)]S,(Op/0x,) + Cup, Bp, y(OLp/0B,, ,)} 
+ (¢,/0x,) {AL p/0(0y,/2x,)|S_ ip — (i/9) (@Lp/OB,, ,)} = 0. 
The second bracket vanishes on account of (21.30) and for the first 
bracket one can exploit the field equations 
(21.38) 
(8L/5y) = (AL,)/044) — (8/2x,) [AL p/O(Op/Ax,,)] = 0 
and 
(21.39) 
(8L/8B,, .) = (0L)/aB, ,p) + (0L,/0B,, ,) —(0/Ax,) (OL/AB,, yy) = 0 
to yield 
(21.40) 
(8/0x,) {(0Lp/2(p/2x,,)]8,,} + Cag,[— Bp, ,(OL5/0B,, ,) 
+ Bp, ,(0/0x,) (@L/0B,, ,.1,)] = 0 


At this point, Eq. (21.33) allows one to substitute for Cg, Bg, ,(0L,/0B,, ,) 
and write (21.40) 


(21.41) 

(2/2, {{AL p/0(0/@z,)1S, $+ Ongy-Bp, (OL n/0B,,»4,)} = 0 
which is of the form (21.34) with the current density 
(21.42) 
To, = [AL p/A(Op/Ax,,) |S + Cnpy Bp, (OLp/OBe, vp) = Ja, uli!) +5e,u(B)- 
Equations (21.39), governing the field B, can be cast in the form 
(21.43) (@Lp/@B,, .) ~ (8/2,) (BL. p/OBy, yp) = ida yl) 


These equations are, except in case C.,,, = 0 which corresponds to a type 
of interaction resembling electromagnetism, essentially nonlinear, be- 
cause the compensating field B acts as its own source in a manner indi- 
cated by the presence of the second term on the right-hand side of the 
current (21.42), The vector particles which in a full quantum theory 
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must appear as the carriers of interaction are, therefore, except in case 
C.py = 0, expected to share with the primary source particles the sym- 
metry property which gives rise to their existence. Thus any vector 
mesons which mediate the interaction caused by the isospin symmetry 
of nucleons must themselves carry isospin. Particles associated with a 
one parameter symmetry are exceptional in that they do not possess the 
relevant source property. Thus photons do not carry an electric charge. 
If one specializes to invariance under rotations in isospace, so that 


(21.44) 

S,.= 7, (« = 1,2, 8); Cupy = teupys traceS, Sg = 28,8; 
one obtains the equations characterizing a vector field B,, first introduced 
by Yang and Mills in an attempt to account for some of the facts of 


strong interactions in terms of vector mesons coupled to the isospin of 
baryons, 


(21.45) Go,uv = Guy = By +9(B, x B,) 
(21.46) 
0, % = [(0/0x,)—ig(T-B,)]4; (0,0, -0, 0.) = —ig(G,,-T) pb 
(21.47) a ae 
(21.48) 
Tay = 4, = iby, TH+i(G,, x B,) = ify, Th+i(By)) x B,) 
(21.49) (8L,/8B,,) = (0G,,,/0x,) —igJ,,. 


The compensating field has the generally attractive feature of inde- 
pendence from the particular form taken by the operators S, which 
generate the symmetry property attached to the y field. It depends only 
on the structure constants C,,,,, which are the same for all representations 
of the operators S,. Physically, this means the compensating field B is 
the same for all y fields that possess the particular kind of symmetry 
represented in terms of the operators S,. Thus there is only one universal 
electromagnetic field which compensates by gauge transformation the 
effects of the one parameter phase transformation generated by the 
operator of electric charge. 

A number of schemes have been proposed linking the empirical sym- 
metries of the strong interactions which reveal themselves through con- 
servation of various attributes such as baryonic charge, hypercharge, 
and isospin (see Section 29), with the existence of various types of vector 
meson fields, each field being generated in the manner described above 
as a compensating field B from the requirement of gauge invariance, so 
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that the mentioned conservation laws are identified with equations of the 
type of (21.41). Notably, a scheme by Sakurai postulates the existence of 
three types of vector mesons which, respectively, are the dynamical 
agents attached by the idea of the compensating field to the two one- 
parameter attributes of baryonic charge and hypercharge, and the one 
three-parameter attribute of isospin. 

Similarly, there exist a number of proposals to account for the weak 
interactions by the intermediary of some vector meson generated as a 
compensating field by the particular gauge symmetries characteristic 
of the particles engaging in these interactions. 

However, all attempts to identify actually observed vector mesons 
with any of the possible compensating fields B that may mediate other 
than electromagnetic interactions encounter a disappointing feature of 
this theory. The formalism developed above cannot accommodate com- 
pensating fields B which contain in the Lagrangian terms of the type 


(21.50) — (46/2) Bap Bu, p (uo @ constant) 


without destroying the general pauge invariance which has been the very 
motivation for this approach. On the other hand, such terms are needed 
if one wishes to describe any of the actually observed vector mesons 
other than the photon, because only by inclusion of terms (21,50) will 
the field equations be of the form 


(21.51) (2G,,/@2,) —p3B,, = igi, 


needed if the interaction is to give rise to short-range forces and, therefore, 
to mesons with finite rest masses in accordance with observation. 

Now, some authors have used the fact that the derivation of the expres- 
sion (21.42) for the current and the conservation law (21.41) are not 
affected by the addition of a term (21.50) to Lz as an excuse to consider 
“partially gauge-invariant” theories in which a field B, not longer 
deserving the name of ‘‘compensating field,” is coupled to a conserved 
current after the fashion of (21.51). Such an approach loses its aspect of 
complete meaninglessness if one keeps in mind the hybrid nature of the 
theory developed above. The field B up to this point has been conceived 
as a classical field. In a consistent quantum theory both the fields and 
B should always appear in conjunction with the creation and annihi- 
lation operators which give rise, in particular, to so-called vacuum 
polarization effects. One can then argue with Schwinger that, in case of 
sufficiently strong coupling, the vector mesons may acquire, even in a 
strictly gauge invariant theory, the propagation characteristics of 
particles with finite rest mass. 

From an intuitive point of view this can be made plausible by the 
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following line of reasoning. In actual electrodynamics the comparative 
weakness of the coupling constant (e* = 1/137) guarantees the stability 
of the photon against possible disintegration into two or more atoms of 
positronium, because the binding energy of positronium is of order 
—e*m as compared to the restmasses m of its constituents. Similarly, any 
external charge Q brought into the electrodynamic vacuum will cause a 
polarization into virtual positroniums only in its vicinity, and will (apart 
from a possible renormalization of its numerical value) retain an un- 
compensated amount which acts as a source of a long range coulomb 
field, If the strength of the coupling were now allowed to increase (and it 
should be possible to do this analytically without destroying the gauge 
invariance, which is a structural property of the theory independent of 
the parameter e¢*), at a certain critical value of order e? ~ 1 an entirely 
different situation would arise. The vector meson would become unstable 
against disintegration into various bound states of the source field and 
acquire propagation characteristics usually associated with massive 
vector mesons that are unstable against decay into two or more pions, 
which in turn may be considered as bound states of nucleons and anti- 
nucleons. Any external “‘charge”’ introduced into this kind of vacuum 
would induce a chain of polarization events which would effectively 
transport the original charge to spatial infinity, leading to complete com- 
pensation of the original charge in any finite volume. Consequently, no 
long range field of the ‘‘charge” could be maintained, and this amounts 
again to the absence of vector mesons with vanishing rest mass in this 
case. 

The analytic penetration of this attractive idea, which has been 
promulgated and shown to be feasible in some simplified models by 
Schwinger, has not been completed at the time of writing. Therefore, all 
gauge theories of strong and weak interactions remain stalled in a state 
of animated suspension until this crucial point, of whether massive 
vector mesons can be consistently accommodated in a strictly gauge- 
invariant theory, has been settled. 


NOTES 


Yang and Mills [1] first attempted a theory of strong interactions in 
terms of a vector meson field which is a compensating field needed to 
guarantee coordinate dependent invariance under rotations in isospace. 

The idea to consider pions as bound nucleon-antinucleon states is due 
to Fermi and Yang [2]. 

Sakurai [3] has attempted a theory of strong interactions by invoking 
three types of compensating fields, corresponding to three types of vector 
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mesons, one generated by a three-parameter group of transformations 
and the other two each generated by a one-parameter transformation 
group. See also Schwinger [4] and Lee and Yang [5]. 

Roman [6] has proposed a scheme of compensating fields which 
incorporates the weak interactions. See also Salam and Ward [7]. 

Glashow and Gell-Mann [8] have considered some consequences of 
so-called “partially gauge-invariant’”’ theories of interaction. 

Schwinger [9] has given a formal, but somewhat unrealistic, example 
demonstrating the consistency of compensating fields having finite rest- 
mass with strict gauge-invariance in case of sufficiently strong coupling. 
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*SECTION 22 


Gravitation as a Compensating Field 


The only other interaction besides electromagnetism which does not 
require field equations corresponding to particles of finite rest mass, and 
is, therefore, already on the level of classical field theory free from any 
obvious inconsistency with a gauge-invariance principle, is gravitation. 

There is, of course, one fundamental difference between electro- 
magnetism and gravitation. Whereas electromagnetic theory can be 
formulated so that it becomes globally invariant under Lorentz trans- 
formations, in the presence of gravitation one can, in principle, only 
require local Lorentz invariance, for the simple reason that, in the 
presence of gravitation, inertial frames of reference are, in general, 
accelerated with respect to each other if they are some finite distance 
apart. The description of an event in space-time requires, therefore, two 
labels, namely the distance x of the event from the origin of the inertial 
frame in which the event is described, and the label w which tells where 
the origin of that inertial frame is located in an underlying curvilinear 
coordinate system. 

For the purpose of illustration consider two observers (0) and (0) 
located at the origins of two inertial frames, respectively. The observer 
(0) is placed at the center of the earth, and the other observer (0) is 
oscillating in a tunnel drilled through the center of the earth, as drawn 
in Fig. 22.1. In the underlying curvilinear continuum the two observers 
will describe worldlines which may be rendered graphically as indicated 


6) 


(0) 


Fia. 22.1, Example of two inertial frames accelerated with respect to each other. 
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(0) (0) 


Fia, 22.2. The world lines of observers (0) and (0) described in an underlying curvilinear 
continuum, 


in Fig. 22.2. It is intuitively clear that the two frames carried along by 
the two observers cannot be connected by a Lorentz transformation, 
except locally whenever the worldlines intersect, because only at the 
instant when the two origins coincide are the two frames in unaccelerated 
motion with respect to each other. In fact, the displacement operation 
becomes nonintegrable under these conditions, because the two inertial 
coordinate frames x and & carried along without rotation by each observer 
will be rotated with respect to each other between successive meetings 
at u, and uz (say), as indicated in Fig. 22.2. 

These considerations can be made formally precise through the 
introduction of a set of 16 functions hK(u) which allow, at each con- 
tinuum point w,, the transformation to a local inertial coordinate frame 
x, by 


(22.1) (dx*/OuH) = hk(w) and (Aul/Aa#) = hit(w) 
sothat  hihy = 8%. 


(By convention, Latin indices refer to components in local inertial frames, 
Greek indices to components in the underlying continuum.) Since the hi 
are not required to satisfy any integrability condition, i.e. 


(22.2) Miv—Miyp #0; ky, = Ohk/Ou", 


they may be considered as a set of 16 independent given functions 
representing the properties of the given gravitational field in which 
observers tied to inertial frames may find themselves. 

The purpose of this section is then to show how the requirement of 
local Lorentz invariance induces dynamical restrictions on the hh in the 
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form of field equations which the hf must satisfy, in analogy to the 
restrictions imposed on the vector potentials of electrodynamics by the 
requirement of gauge invariance leading to Maxwell’s field equations as 
a possible set of such equations. 

Contact with the standard formalism in the theory of gravitation can 
be made by noting that if the metric of any inertial frame z is taken to be 


(22.3) 9x = 8,  sothat ds* = 3,da'dz', 
then the metric g,,, of the continuum wu, defined by 

(22.4) ds* = g,,,duhdu” = (Aa/Aw) (Bar*/Ou) 8,, dul du’ 
can be expressed in terms of the functions hii as 

(22.5) Juv = Mi, WES, = Nyy, hk 


and therefore all other quantities characterizing the geometry of the 
continuum w, such as the affinities le and the curvature tensor Tees 
are expressible in terms of the field variables h¥. 

The question is now whether the functions ae representing gravitation 
can be related to a compensating field B whose existence will become 
necessary when one requires invariance under local Lorentz transforma- 


tions which connect any local inertial frame x* with another local frame 
# by 


(22.6) w= acktdu)a; Mu) = —H(u) 
affecting the functions 4/ themselves according to 
(22.7) Sh = ef(u)ht, 

and transforming any ¢ function according to 

(22.8) dys = fe¥(u) My. 


Here the generators M,, representing the Lorentz transformation satisfy 
the C.R.s (see Appendix 4) 


(22.9) (Mi Minn] = 3C% mn Mars = May = —Mig 
where 
(22.10) CB mn Ory = 4(Sy, 515 — 845 Bui)» 


Indeed, any action integral 


(22.11) I = [ Lip, (0p/axt]d*x 
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governing the dynamics of the function in an inertial frame reads in 
curvilinear coordinates 


(22.12) 
T= f Lip(u), ipl) (@p/aw')|hatu = f Ply, (Op/du), hg} d*u; 
h = det|ht|; L =hL; 


and is not invariant, as it stands, under the transformation (22.6). In 
analogy to the development leading to (21.8) this deficiency can be 
repaired, however, if one postulates the existence of a compensating 
field 


(22.13) Bi(u) = —Bi(u) 
transforming according to 
(22.14) 
SBi = 4C%, y€"(u) BY + (Be4/Out) = ek, Bi! + &j, Bi” + (de /Ou) 
and replaces Y by 


(22.15) L = Lh, 0,4, hh) 
where 0, stands for 
(22.16) Aub = [(0/du")— 4B" My). 


The relationship between the compensating field B and the gravita- 
tional field A is now obtained when one notices the identity of the “gauge- 
invariant derivative” (22.16) with what is commonly known as the 
“covariant derivative ” of ys. To see this write down (22.16) for the special 
case of a tensor field 4 with components % in the local Lorentz frame, 


(22.17) O,~9 = (Op /OuH)— Be bi — Bibi 
and go over to the components 1/?? of jin the curvilinear frame by 
(22.18) pre = hahgyh; pl = hi hd per. 


After multiplication of (22.17) with A4h? and utilization of bf = dp" as 
well as hk ht = 54, one finds 


(22.19) AAAS A, p4 = (Bro /OwH) + P'e,.yre+ Te, yr 
with 
(22.20) rah = neki, — Bey 
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where 
(22.21) Be, = hh, Bi. 


If one can now show that the symbols I%,, introduced in (22.20) are 
identical with the affinities 


(22.22) Dey = 49°? Gorip + Ipolr —Iurio) 
then one has established the identity of (22.19) as the covariant deriva- 
tive. To this end write down Eq. (22.19) for the metric tensor itself, 


(22.23) Nhs Ay(89) = Gpojp+Tu97? +13, 9°". 

Since, by definition (22.17) and the antisymmetry of the fields Bit in the 
indices i and k, one has 

(22.24) a,(84) = —Bi—Bii = 0 

the right-hand side of (22.23) must vanish. One can solve then uniquely 
for the I’, provided one assumes them to satisfy 


(22.25) Te, = I, 


and obtains then Eq. (22.22). Therefore, Eq. (22.23) is nothing but the 
covariant derivative, denoted g,,,,,, which vanishes, and (22.19) is 
recognized as the covariant derivative, 


(22.26) HAAS Op = per. 


Equations (22.20) and (22.21) contain, therefore, the desired relationship 
between the compensating field Bi and the gravitational field h/. 
As in the corresponding case treated in Section 21, the Lagrangian for 


the field B, which must be added to Y to give the entire Lagrangian 
(22.27) L=LP+L, 


from which the field equations for the gravitational field h are obtained 
by variation 


(22,28) (8L/3hi,) = 0 


is not uniquely determined by the requirement of gauge invariance. All 
one can say is that “, must be of the form 
(22.29) Ly = Ly(hi, Fs, 


where Fi, is defined in terms of the compensating field by 


(22.30) Fil, = (OBY/dut) — (AB /u") — 4.0%, «4 (BP BY — BY Be). 


One can easily show by straightforward computation, using relations 
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(22.20) and (22.21), that Fij, is related to the well known curvature 
tensor 


(22.31) RK, = (OP%,/@u") — (OL, Out) +13, 5,-P Toy 


by 


(22.32) FE, = WARERY 


The gravitational field equations proposed by Einstein, in particular, are 
obtained if one chooses 


(22.33) Ly = «OWR 
with 
(22.34) R= gh Ry = MEL; Ruy = Rive 


where the coupling parameter «, as the electric charge in the corres- 
ponding case, remains completely undetermined by the theory at this 
stage. 

For further details the reader is referred to the work of Utiyama. 


NOTES 


Utiyama [1] first developed fully the general formalism of compensat- 
ing fields and applied it, in particular, to the case of the gravitational 
interaction. See also the review article by Adamskii [2]. 
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——— SECTION 23 


The Starting Point of Quantum Electrodynamics 


The considerations of Sections 20 and 21 suggest, though do not prove, 
that in a complete quantum theory of fermions interacting through the 
medium of the electromagnetic field, the Hamiltonian of uncoupled 
fermions and photons, reading in the notation of (19.54) and (18.79), 
under omission of any zero-point energies, 


(23.1) H° = HY,+Hy 

(23.2) Hg = z x Q(Ns(k,r)+N_(k,r)]; 2 = +Vk®+m* 
(23.3) HY = x x wn(x, 8); w = |x| 

should be augmented by an interaction Hamiltonian 

(23.4) = -{ Ajuda 


where now both 4,,(x) and j,,(7) are given, as in (18.86) and (19.56), in 
terms of creation and annihilation operators b*, b and a*, a, respectively. 
This interaction Hamiltonian is a function of time, containing eight basic 
terms, each linear in the photon operators b* and 6, and bilinear in the 
electron-positron operators a7, and/or a,. The integration over q in (23.4) 
yields for each of these terms a 5 function in the momenta of the involved 
particles, incorporating the conservation of momentum in all transitions 
between states caused by the interaction. With the notation (see Appen- 
dix 6) 


(23.5) 3(k) = [1/(2n)%) f edgy 


one finds 


(23.6) H(t) = —[e(2n)3/VV] 5 P» Dy = x > (1/V2w) 


x [eu(S) Ay (k’, 1) y, A4(k, 7) (xe, 8) at (k’, 7’) a.(Ik, 1) 

x e-Book —x) + €8(9) A, (k', 1’) y, Ay (k, 7) 

x b* (x, S) at (k’, r’) ay.(Ik, 1) ef -2+0)t § (kk! — Ie + xe) 
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—€,(8) A_(k’, 1’) y, A-(k, 7) b(x, 8) a*(k, 7) a_(k’,r’) 

x e UO Ota) §(e’ — Kk +x) — €#(S) A_(k', 7’) y,, A_(k, 7) 
x b* (x, 8) at (Kk, 7) a_(Ik’, r’) e2'—2-w)t 8k’ — x — x) 
+<,(S) Ay(k’, 7’) y, A-(k, 7) b(%, 8) at(k’, r’) aX(I, 7) 

x eO42-o)t S (Ic! +k — xe) + 8 (S) A, (K’, 2") y, Ak, 7) 
x b+ (x, S) at(k’, r’) at(k, 1) el 242+) Sk’ +k +x) 
—«,(8) A_(k’,r’) Yp x(k, 7) b(, 8) ay (kk, 7) a_(k’, 7’) 

x e WO O40) Sk’ +k +2) — eX (S) A_(K’, 7’) y, A+(k, 7) 
x b+ (x, 8) a4.(k, 7) a_(kk’, r’) e UP +2-t Sk’ 4k — x)]. 


This expression for the interaction Hamiltonian of quantum electro- 
dynamics has been written out in all detail to impress upon the reader 
the formidable computational task faced whenever one tries to extract 
from it information about the outcome of possible experiments. 

Since the inception of quantum electrodynamics two avenues of 
attack on the mathematical problem of how to disentangle the ramifi- 
cations of the specific form (23.6) for H’ have yielded results, leading in 
many instances to quite fabulous numerical agreement between predic- 
tion and observation, namely: 


(i) the development of computational techniques making the appli- 
cation of so-called perturbation theory to H’ tractable, and 

(ii) the exploitation of symmetry properties of H’, giving rise to a 
number of so-called selection rules and other general consequences aiding 
the computation of experimentally accessible quantities such as cross 
sections and lifetimes. 


If the separation of the Hamiltonian H = H° +H’ into an uncoupled 
and an interaction term is at all meaningful, the results inferred from 
symmetry considerations ought to be independent of the numerical 
value of the coupling parameter e. The same cannot be said for pertur- 
bation theory whose validity appears to be dependent on the assumption 
of ‘weak coupling,’ meaning e” < 1, and is even in this case beset by a 
number of perplexities which required development of rather daring 
mathematical techniques known as “renormalization” procedures. The 
case of strong coupling, meaning e” > 1, has thus far resisted all attempts 
aimed at its mathematical penetration, despite massive efforts that have 
been brought to bear on this problem, The task of developing a strong- 
coupling quantum electrodynamics has become more urgent since the 
advent of novel points of view, touched upon in Section 21, which make 
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it appear probable that all fundamental interactions are, in fact, media- 
ted by vector mesons, for which the photon may serve as a model. 

Because of the peculiar combinations in which the creation and anni- 
hilation operators appear in the various terms in H’, each term will have 
nonvanishing matrix elements only between states which differ from 
each other appropriately in the occupation numbers. Thus a term 


(23.7) b(x) at(Is’) a. (kk) 
will give a contribution unequal zero only to the matrix element 
(23.8) <n) — 1, 14,(k’), 04.(Is)| A |n(o¢), 04.(Ik), 14.(k)>. 


The classification of such contributions is greatly aided by a graphical 
technique due to Feynman, which consists of the following conventions. 

Each term in H’ is represented by a vertex from which emerge or into 
which enter lines, one line emerging for each photon creation, electron 
creation, and positron annihilation operator, and one line entering for 
each photon annihilation, electron annihilation, and positron creation 
operator. Lines representing photons are drawn dotted, and lines repre- 
senting electrons or positrons are drawn solid, with the further conven- 
tion that all lines representing electrons are directed upwards and lines 
representing positrons are directed downwards. 

Thus the eight terms of H’ are represented by the following “Feynman 
Graphs”’ (letting now k stand for the labels k,r and x for x, 8): 


K K k 
ae Bi <e 
k k K 
(I) b(«) af (k’) a, (Ie) (TL) 6* (x) af (ke) a, (Ie) (TIT) 6(%) a (Ie) a_(k’) 
‘ A é 
spiel val va 
K a-€5=- ->--- 
(IV) b*(«)at(k)a_(k’) — (V) be) at (k’) at (k) (VI) b*(«)at(k’)at(k) 


(VII) b(%)a,(k)a_(k’) (VIII) 6*(%)a,(k)a_(k’) 
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One envisages the possibility of dissecting any actual interaction process 
into such elementary vertices representing virtual elementary acts of the 
following eight types: 


(I) Absorption of a photon under scattering of an electron 
(II) Emission of a photon under scattering of an electron 
(III) Absorption of a photon under scattering of a positron 
(LV) Emission of a photon under scattering of a positron 
(V) Absorption of a photon under creation of an electron-positron pair 
(VI) Emission of a photon under creation of an electron-positron pair 
(VIL) Absorption of a photon under annihilation of an electron-positron 
pair 
(VIII) Emission of a photon under annihilation of an electron-positron 
pair 
In accordance with the fundamental dynamical postulate developed 
in Section 7, any state vector |u(t)> = |b(t)>; describing a system of elec- 
tromagnetically interacting fermions in the interaction picture, which is 
appropriate here, will satisfy (7.10) 


(23.9) i(a/at)|u(t)> = Hy(t)|u(t)>. 


Comparison of the consequences of this equation with experiment is 
facilitated if one performs a formal integration leading to introduction 
of the concept of the scattering matrix. 

For a first orientation, this concept can be adumbrated by the following 
line of reasoning which might be useful as a mnemonic device, pending a 
more detailed treatment to be given later. In an infinitesimal time 
interval At one has 


(23.10) |u(t+ At)> = exp[—iH'(t) At] |u(t)> 
so that, by iteration 


(23,11) 
|u(t+24t)> = exp[—iH'(t+ Mt) At}exp[ —1H'(t) At] |u(t)>, ete. 


Now, since in general H’(t+ At) and H(t) do not commute, one must take 
into account 


(23.12) et eB 4 cAtB if AB#BA 
preventing one from writing down immediately 


[| expl—-iH'(t+ nt) At] = eso] faa, 
to 


n=0 
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However, one can arrive at a closed formula by introduction of a time- 
ordering operator P, so that 
(23.18) |u(t+2At)> = Plexp (—i{H'(t+ At) + H’(t)} At)]]u(t)> 
where 

_fHt)Hh) if >t 
(23.14) PLH(t)) H(t)] = { ee ee 


and generally 
(23.15) 
P[H(t;) H(tg)...H(t,)] = H(t) H(t))...H(t,) with t, >t >... > te 


One can thus write down, formally, the solution 


t 
(23.16) ju(t)> = pleso( if aoa |u(to)>, 
in 


in which P[ ] can be looked upon as a unitary operator connecting the 
state vector at time f) with the state vector at time ¢. Incidentally, time 
ordering after the prescription (23.15) is a relativistically invariant 
concept, provided P is applied to operators which commute for space- 
like points. 

When applying this solution to experimentally realizable situations 
one is normally interested in a comparison of the “final” state of 
the system |u(f= +)> with the “initial” state in the remote past 
|u(t = —«)>. With these limits one has then 


(23.17) Ju(t = +0) = S|u(t = —)) 
where the unitary scattering operator S is defined by 


(23.18) S = Peso( ; wat = Plexp(-i J Hx) d*2)| 


with 
(23.19) Hx) = —A,(x)j, (2) 


in the case of quantum electrodynamics. 
The matrix elements between some initial state, labeled by a complete 


set 7 of quantum numbers as |r) = |u(¢t = —)>, and some final state, 
labeled similarly |r") = |u(¢t = +0), turn out to be generally of the form 
(23,20) <1'|S|r> = 8, +<7'|R|7> 8(P’ —P) 8(P5 —Po) 


where <7’|R|r> (the “‘reaction matrix”) is some regular function of the 
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momenta p and the energies py of the particles involved in the transition. 
P and P, stand for the total momentum and the total energy of the 
system in state r, and the 6 functions express the conservation of these 
dynamical quantities. 

When computing transition probabilities between states +r’ #7, so 
that 5,,, = 0, one has to keep in mind when squaring (23.20) 


(23.21) [8(P’"—P)8(P)—Po)}? = [1/(2n)*]8(P’ —P)8(P)—Po) f d*a 


(see Appendix 6), so that one can meaningfully define only the transition 
probability per unit volume and unit time, 


(28.22) 
= lS? 
wr Tate 


By standard procedures one can introduce at this point the idea of the 
“cross section”’ for scattering into a given final state, essentially by 
dividing (23.22) by the number of incoming particles per unit time and 
unit area. Some authors have spent considerable effort on defining cross 
sections in a relativistically covariant manner. Although such definitions 
can be useful, especially if one wishes to make calculations in the bary- 
centric system, they are rather cumbersome and require complicated 
notations when more than two particles in the initial or final state are 
considered. Since in practice one deals usually with a laboratory frame 
of reference with one of the particles in the initial state at rest, and rarely 
considers situations with more than three particles in the final state, 
it is usually most convenient to make the so-called phase space con- 
siderations as one goes along in the description of a particular experi- 
mental situation. The standard example for phase space considerations 
in quantum electrodynamics is the well known Compton effect, which is 
treated for the purpose of illustrating the use of the 6 function in Appen- 
dix 6. Readers interested in other examples are referred to the work by 
Fermi quoted at the end of this section. 

Since the scattering matrix is the concept which most closely expresses 
the quantum mechanical view taken of processes involving transitions 
from an initial to a final state, it warrants a somewhat more detailed 
treatment than the derivation of expression (21.18) given above. 

Consider quite generally a physical object for which, by assumption, 
it makes sense to write the Hamiltonian 


(23.23) H = H°+H’ 


w, = [1/(2m)*]|<7'|R|7>|°8(P’ —P) 8(P5 — Po). 


where H’ represents the interaction energy of the system. Assume 
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further that H’, if written in the state picture, does not contain the time 
t explicitly. 

Following the development of Section 7, one has various possibilities 
of describing the object. 


(i) In the state picture the state of the system at time ¢ can be defined 
through observation at time ¢ and satisfies, if it is denoted by |d(t)>,, 
the Schroedinger equation 


(23.24) i(0|b),/8t) = Hb), 


Solutions of the Schroedinger equation in absence of H’ will be denoted 
|e(t)>, so that in the state picture 


(23.25) i(a|e>,/8t) = H|c),. 


(ii) In the operator picture the state is a fixed vector in time and may 
be taken as identical with the initial value of |b(t)>, in the state picture, 
|b>o = |b(0)>,. The transformation between state and operator picture 
is mediated by the unitary operations [see Eq. (7.17)] 


(23.26) —|b(t)>, =e" |b>o; elt), = exp(—tH"F) |c)o. 


(iii) In the interaction picture the concept of state has again a different 
meaning. One speaks of a “state at time ¢” in the interaction picture, 
denoted |b(t)>; = |w(t)>, and means a description obtained from observa- 
tions at time ¢ and then mentally reduced, so to speak, by interaction free 
motion to time t = 0. The connection of any state vector |w(t)> in the 
interaction picture with the corresponding state vector |b>) in the 
operator picture is thus 


(23.27) |b(t)>; = |w(t)> = exp (iH°t) e#* |b», 

i.e. the development of the state in time is determined essentially by H’. 
Introducing the notation 

(23.27') U(t) = e*exp(—iH°t); U*(t) = exp (iH°t) et 

one can write Eq. (23.27) 

(23.28) |u(t)> = U+(t) [bdo 

and has thus 

(23.29) |uu(ty)> = U*(t,) U(ty) Jults)> = Uta ty) falty)> 

with 

(23.30) U(ta, ty) = U*(te) U(ty). 
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This transformation operator U(t:, t;) satisfies the equation 


(23.31) t[OU (tg, ty)/Ot.] = [H'(te)]r U (te, t) 
where 
(23.32) [H'(t)], = exp (iH°t) H’ exp (—iH°t) 


is the interaction Hamiltonian in the interaction picture (the “effective” 
interaction) containing the time dependence as in the example (23.6) 
given above. To prove Eq. (23.31), note that, according to (23.27) and 
(23.26), the ‘‘state at time ¢” in the interaction picture is the “state” in 
the staté"picture reduced by interaction free motion to time t = 0, 
(23.33) |u(t)> = exp (iH°) [D(t)>, 

and use Hq. (23.24) so that by differentiation one obtains 

(23.34) 
i[0|u(t)>/At] 


ll 


—H° exp (iH°t) |b(t)>, +iexp (iH° t) [A|b(t)>,/0t] 
= exp (iH°t) H’|b(t)>, = exp (iH°t) H’ exp (—iH°t) |u(t)>. 
Equation (23.31) can be integrated formally, using the boundary 


condition U(t,t) = 1, to give 
te 


(23.35) U(tety) = 1-t J [H'(t)], U(t, ty) dt. 
& 
Writing 
(23.36) U(tyty) = Ut, ty) + UM(t, ty) + Ot, ty) +... 
one obtains by iteration, starting with U(t,t,), 
(23.37) 


ty ty t 
U (tests) = 1a f (A Wlrdt-+(—a? f CH Olat { LHC) ira’ +... 
t rs ty 
which can be summed to the time ordered product 


7 7 
(23.38) Utatyr= Pleso(-i f moat]. 
4 


From this the scattering operator S, connecting initial state 
|7> = |u(t = —2)> with final state |r’> = |u(t= +)>, is obtained as 
the limit 


(23.39) S = U(+@, —«). 


Although the interaction picture is particularly adapted to description 
of scattering processes, such processes can also be grasped in the state 
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picture as well as in the operator picture through introduction of the 
concepts of “final and initial configuration” and “outgoing and ingoing 
states.” 

Consider, for simplicity’s sake, a system which possesses no bound 
states, and in it are found only so-called single channel scattering 
processes, which means to each solution |b(t)>, of the Schroedinger 
equation (23.24) there should correspond wniquely a solution |c,(t)>, of 
(23.25) through the requirement 


(23.40) lim |b()>, = lim |c+(f)), 
to++o toto 


and another solution |c_(t)>, of (23.25) through the requirement 
(23.41) lim |b(t)>, = lim |c_(t)>,. 
t+-0 te-0 


These definitions of the ‘‘result of scattering” |c,> and the “cause of 
scattering” |c_> imply the notion of an interaction which is absent for 
large time ¢ > 0 and for large time ¢ < 0 and which is “‘turned on” only 
during the scattering process proper in between these times. 

One can now introduce the concept of “final and initial configuration,” 
denoted |c,(0)> and |c_(0)>, which are obtained, respectively, by 
calculating the result of scattering |c,(t)> for large t > 0 backward to 
t = 0, and the cause of scattering |c_(t)> for large t < 0 forward to t = 0, 
assuming interaction free motion in accordance with the second equation 
(23.26). Having done this, one is now in a position to relate the actual 
state |b(0)> at time t = 0 to the final configuration |c;(0)> by 


(23.42) |2(0)> = Q,|e,(0)> 

where 

(23.43) Q, = lim e#exp(—iH°t) 
t++o 


and to the initial configuration |c_(0)> by 


(28.44) |b(0)> = 2 fe-(0)> 

where 

(23.45) Q. = lim e#texp(—iH°t). 
t+-0 


In the operator picture one can thus characterize any scattering state 
|b» either by its initial or by its final configuration. Conversely, a state 
with final configuration |c,(0);7>, where 7 stands for the set of quantum 
numbers characterizing that state, can be denoted |b,(out)>) and called 
the “outgoing state,” and a state with initial configuration |c_(0);7> 
can be denoted |b,(in)>» and called the “ingoing state.” 
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An alternative scattering operator can now be defined in the operator 
picture, after a precedent by Yang and Feldman, as the projection 
operator which projects the outgoing states on the ingoing states, 


(23,46) |b,(in)> = Sy|b,(out)> for all r, 


so that the probability amplitude for finding the outgoing state 
|b,,(out)>, if the ingoing state is known to be |b,(in)>, is 


(23.47) <b,-(out) |b,(in)> = <b,-(out) |Sy|b,(out)>. 


The scattering operator Sy may be expressed in terms of the operators 
Q.,, because one may write (23.46) 


(23.48) Q_|c_(0);7> = Sy 24|c4(0);7> 
and since |c_(0);7> = |c,(0);7>, one has 
(23.49) Ay = QugQe 
where use has been made of the unitarity 
(23.50) OO: 
NOTES 


Feynman’s book [1] contains, inter alia, reprints of Feynman’s early 
papers in which the graph technique is developed. Fermi’s book [2] 
contains a number of examples in which the most ingenious use is made 
of phase space considerations. The scattering operator was invented by 
Wheeler [3]. The alternative definition of Sy is due to Yang and Feldman 
[4]. 

Readers interested in how to overcome difficulties encountered 
through presence of bound states, and in the case of multichannel 
scattering, are referred to the works by Ekstein [5] and Jauch [6]. See 
also the review article by Brenig and Haag [7]. 
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SECTION 24 


Perturbation Theory and the Propagator Concept 


Experimentally the coupling parameter e of quantum electrodynamics 
is found to have the value e? ~ 1/137, and, on the strength of this weakness 
in the coupling between charged fermions and photons, a so-called 
perturbation treatment of the interaction has been developed, consisting 
essentially of the hope that an expansion of the scattering operator S in 
the form 
(24.1) S=I+d J os J Pq (4, ++ 15%q) d42,...d4a, = 1+ D Sy 

nel nel 
is meaningful even though some of its terms turn out to result in diverging 
matrix elements. 

By comparison with (23.18), each of the terms in expansion (24.1) is 
obtained by the substitution 


(24.2) Sq = [(—i)" [nl] PL (ary)... HE") 
and the matrix elements to be computed are 
(24.3) 


<r! |Sql7> = (—ae/2)" (Unt) f ..- f oP), y4, Her) Ay (es) --- 
[B(2n)s Yan, P(%n)] A pun (%n)} [7 44. ..d* ay. 


The reduction of this—on first sight—overwhelmingly complicated 
expression to a form more tractable for actual computation can be done 
in a sequence of steps, which are the fruits of laborious efforts carried 
out by a number of workers in the late 1940’s and early 1950's, 

(i) Instead of using the “time ordered” product 


P[A(a,) A(x)... A(%p)] = A(x) A(a;)...A(a,) with t, >t >...>h 


of a set of n boson or fermion operators as defined in (23.15), it is more 
convenient to work with the ‘chronological product” denoted 
T[A(x) A(xz)...A(2%,)] which differs from P by a sign factor 7 

(24.4) T'[A(«y) A(z)... A(@n)] = 7A(a;) A(a;).. A(x) 


with (>t >...>h 
203 
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where 7 = +1 if the conversion of the sequence A(«,) A(x2)...A(w,) into 
A(x;)A(a;)...A(a,) requires an even number of transpositions of fermion 
operators, and 7 = —1 if this conversion involves an odd number of such 
transpositions. Since S, contains all fermion operators in pairs, any 
change in sequence y(ax;) y(2;) (as) ys(ar;) > fi(x;) yp(av;) (a;) (x;) involves an 
even number of transpositions of fermion operators, and one has there- 
fore simply 


(24.5) PH")... HC (ap)] = TLIO" (3). HE" (2p) ] 


(ii) All operators A,(x), (x) and yi(x) are separated into their 
“negative frequency part” and their ‘positive frequency part”: 


A, (x) = AD (2) +42) 
(24.6) (x) = Pa) +Yy(a) 
P(x) = FOr) + P(x) 


being essentially a separation into annihilation and creation operators, 
since % functions with phase e“ = e- have always been associated 
with annihilation operators and phases e~* = e~“-“) with creation 
operators. Thus, according to definitions (18.86) and (19.51) 


AD (x) = VV) >» [eu(S)/-V/ 20] e** b(x, 8) 
AP (x) = (VV) & D» [e&(S)/V 2a] eo b* (xe, 8) 
Pa) = >>» A,(k,r) ec a,(k, r) 


(24.7) F 
Ya) = YD A-(k,r)e“™ at(k, r) 
kr 


POR) = z > A_(k, r) e** a_(k, r) 
$x) = DY Ay(k, r) e™ at(k, r). 
kr 


(iii) One aims at reordering the operators making up S,, so that all 
creation operators stand to the left of all annihilation operators, and with 
this aim in mind one defines the “normal product” denoted N[...] of two 
boson operators A,(x) and Ao(z2) as 
(24.8) 

N{A4(@1) Ao(we)] = AYP (@) ASM(we) + A(M (#1) AP (we) 
+ AS (#2) ADP (a1) + AP (1) AP (#2) 


and of two fermion operators y1(a 1) and fo(22) as 
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(24.9) 
NUyfrr(1) ro(ae)] = YAP (wr) YS? (eo) + Pf? (ae) PP (we) — WS? (we) PP (wr) 
+P(w1) PP(w2). 
The definition of the normal product of any number of boson and/or 
fermion operators follows from these basic definitions by induction. The 
minus sign in front of the third term on the right-hand side of (24.9) has 
been introduced to accommodate the anti-C.R.s between fermion 


operators consistently. 
Introduction of the normal product is advantageous for three reasons: 


(a) The vacuum expectation value of any normal product vanishes, 


(24.10) <O|N[...]]0> = 0. 
(b) Since 
(24.11) N[Aj (#1) Ao(%2)] = N[As(x2) Ai(a1)] 
and 
(24,12) N [yx (271) oo(w2)] = —N[yo(w2) $1(e1)] 


one can treat any normal product N[...] as if all boson operators inside 
the bracket [...] always commute and as if all Fermion operators inside 
[...] always anticommute. 

(c) The operators in the interaction Hamiltonian #’(x) are already 
in ordered normal form, as is immediately obvious from (23.6) when one 
remembers that Boson operators and Fermion operators commute. One 
can thus write 


(24.13) H'(0) = —e NUP) y, Ay (a) (a). 


This is, of course, a consequence of the particular definition of the 
current density, which was arranged such that the vacuum expectation 
value of the current vanished. 

(iv) The conversion of a chronological product into a sum of normal 
products is possible, because the chronological product of two operators 
differs as a consequence of C.R.s or anti-C.R.s from the normal product 
of these two operators only by a number, which is called the “chrono- 
logical pairing.” Denoting this “chronological pairing” number asso- 


eps 
ciated with two boson operators A(#,) and Ao(xz) by A4(x1) Ao(x2), one 
has by definition 


(24,14) 
TA, (x) Ao(@2)] = N[Ay(®1) Ao(%2)] + Ai(®1) Ag(xo), 
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and denoting similarly the ‘‘chronological pairing’? number associated 
with two Fermion operators y(%,) and yo(%_) by y4(2%1)yo(%2) one has 
(24.15) 

Iya) bal)] = Myla) baled] + Halen) Pelee) 


Equation (24.10) allows one to compute any chronological pairing as a 
vacuum expectation value: 


(24.16) 1(%) A o(%2) = <0|T[A 4 (21) A(29)]|0> 
(24.17) Paley) alts) = 0/7 [p1(@1) Po(w2)]|0>. 


By successive use of prescriptions (24.14) and (24,15), one can now 
express any chronological product as a sum of terms containing only 
normal products and chronological pairings. More precisely, by induction 
one arrives at a theorem first proven by Wick: Any chronological product 
is equal to the sum of all possible normal products that can be formed with all 
possible pairings. The meaning of this statement is perhaps best communi- 
cated by writing out its application to the cases of two, three, and four 
operators: 


(24.18) 
T[AB] = N[AB]+N(AB] 
T[ABC] = N[ABC]+N{ABC]+N[ABC]+N{ABC] 
ABCD] = N[ABCD]+N{ABCD]+N[{ABCD]+ N[LABCD] 
loess! Fea] Fes 
+N[ABCD]+N[ABCD]+ N[ABCD] + N[ABOD] 


cnbbicmsndle 


The notation 
(24.19) 


esa, eat fal 
N[ABOD] = 7CDN[AB]; N[ABCD] = yADBC; ete. 


has been used, where 7 is again the sign factor equal to + 1 or — 1 depend- 
ing on whether an even or odd number of interchanges in fermion opera- 
tors are needed to convert the sequence of operators ABCD into the 
sequences CDAB, ADBC, etc., respectively. 
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The ground is now prepared for the conversion of the term S,, of the 
scattering operator (24.1) into a sum of terms which contain only normal 
products and chronological pairings. For > 3 this is still a rather for- 
midable task, but for n = 2 one obtains, upon observation of (24.5), 


(24.13), and Wick’s 


theorem, and keeping in mind that chronological 


mM el 
pairings between a boson and a fermion operator, as well as yx) and yf, 
vanish, a decomposition of 


(24,20) 


Se = —(€/2) f[ PENH) yp, Ay 1) HO) NP 2) ru, Ay (2) 
x (wp) ]}d4 x, d* xe 


into eight terms 


(24.21) 


Vit 
8, = 3 9 
C=I 


which will now be written down: 


(24.22) 
SP = — (67/2) [fH 


uD = —(€2/2) f f N 
Sup = —(e/2) ff N 
SiY) = —(e?/2) ff N 
89) = — (7/2) [J ¥ 


syyy 


—(e2/2) J J Ny 


sgvp —(e?/2) ff N 


SD — —(€2/2) spa 


#21) ¥p, Ap, (#1) (#1) P(@2) Yp, Ap,(@2) laa) ]d4 x, d 42x, 

B(x) Vp, Ays(1) H(@1) f(@2) Y¥p, Ay,(@2) p(%2)|d* a, d* x» 
aM eat) (een c= ae | 

[(@1) Yy, A,,,(%) (a1) P(a2) Yur Ay,(©2) p(x) d* a, d* xe 


| re aaa Se eameae | 
P(%1) Yu, Ap,(@1) Pla) Pare) yy, A,,,(2) (xa) d* x, d* xe 


Sr ee 

f(r) Yur A,,,(%) (x) f(x2) Vpur Apg(@2) (2) ] ds xd‘ xe 
aan See 

[y(x1) Yr Ap, (%1) #(%1) B(x) Vie A,,,(%2) (v2) ]d*x, d4 x. 


CL... Sd 
(i (1) Vien A,,,(%1) h(x) (x2) Vis A,,,(%2) (xo) |]d*ayd* axe 


ieee! 
[y(21) pr Ay,(1) H(%1) $ (2) Vue A,,,(%2) p(x) |d* x, d* xe 
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The conventions, explained in Section 23, leading to the Feynman 
graphs representing the various terms in the interaction Hamiltonian, 
allow a picturesque classification and extraordinarily suggestive inter- 
pretation of every term in the scattering matrix, after it has been ordered 
into normal products, if these conventions are extended to envelop 
chronological pairings, which are not operators, as follows. 


(a) Draw a vertex labeled a, for each integration variable x;. Since 
every such variable appears in conjunction with one and only one 
factor y,,, this convention is equivalent to drawing a vertex labeled 
for each factor y,. 


(b) Draw a solid line entering the vertex «, for each unpaired operator 
i4(x;), a solid line leaving 2; for each unpaired yf(«;), a dotted line without 
direction connected to 2; for each unpaired A(x;). These conventions are 
consistent with the ones adopted in Section 23, but are less detailed 
because they do not yet distinguish between ingoing electrons and out- 
going positrons, between outgoing electrons and ingoing positrons, and 
between ingoing and outgoing photons. 


(c) Since chronological pairings occur only as connections of one 
symbol ys and one symbol and never between two symbols % or two 
symbols J, and only between symbols 4 and % belonging to different 
vertices x; and x, the following conventions are sufficient and consistent. 
Draw a solid line connecting the vertices x; and x, in direction from 


x, to «; for each pairing y(x;)%h(a,), a solid line directed from x; to x, for 
| ag! 
each pairing #(x;)p(x,), and a dotted line without direction between 


- 
x; and x, for each pairing A(x,) A(2;). 
With these conventions all remaining operators %, , A in S, contri- 


en A ay Te 
bute only “external lines” and all pairings yb, yxb, AA contribute 
only ‘‘internal lines,” such that all fermion lines can be drawn by 
following arrows of direction from end to end without hiatus. For 
example, the eight terms (24.22) will correspond to the following eight 
graphs: 
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syn syn 


With the exception of S$? all graphs representing the various terms in 
S, have one or more internal lines. After some contemplation of 
these drawings one is inevitably tempted to adopt, with Feynman, a 
picturesque manner of speaking about them which conjures up for every 
internal line the image of a particle propagating in a state of virtual 
existence between the vertices of that line. Accordingly, the factors 


aa 
P(x;) yl) 


and 
Ay, (a; Ayal) 


will from now on be referred to as “fermion propagator” and ‘photon 
propagator,” respectively. 

One has to go back in the history of physics to Faraday’s concept of 
the field line if one wants to find a mnemonic device which matches 
Feynman’s graph in propagandistic persuasiveness. This historical 
analogy may serve here as timely warning against all too literal ac- 
ceptance of mental images based mainly on a fabric of conventions, 
however consistent that fabric may appear. Thus, Maxwell was led 
by all too literal acceptance of Faraday’s field concept to an ether theory 
of vacuum which ultimately turned out to be abortive. Similar tempta- 
tions are lurking behind Feynman’s graphs, especially the ones of the 
type Siv™D, 

Evaluation of matrix elements requires separation of all remaining 
operators into negative and positive frequency parts. Accordingly, one 
has a decomposition of S$? into 2° = 64 terms, of S$”, SY"), and S{V) 
each into 24 = 16 terms, of S$”, S$"), and SSY™ each into 2? = 4 terms, 
and S$¥1 requires no further decomposition. In the language of Feyn- 
man’s graph this amounts to sorting the external lines into electron lines, 
positron lines, ingoing photon lines, and outgoing photon lines. If one 
adopts again, as in Section 23, for external lines the convention of drawing 
electron lines pointing upward and positron lines pointing downward, 
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as well as drawing all external photon lines pointing upward, one arrives 


at a unique decomposition of each basic graph S$” into a sum of “directed 
graphs.’ One has thus, for example, 


(24.22) sgp = y Fit 


d=1 


i.e. 
; 
4 


4 A 

=- 1 + a + { + { 

ANA ee 

\ 4 oe 4 4 

+ + + } { + < 
pi int 

’ a I / 

A A 4 ; 

+ + + + 

ANG eT jo 


4 MN ee 


and a completely analogous decomposition of S{™. 


It is at this stage of the development that there emerges quite trans- 
parently the consistency of a manner of speaking about fermion lines, 
which refers to positrons as “electrons running backward in time,” by 
associating the upward direction in each directed graph with the direc- 
tion of time. This convention may be extended to fermion propagators, 


so that occurrence of a factor yi(a,)x(x;) is represented graphically as a 
virtual electron propagating from , to x; provided ¢, < ¢; and as a virtual 
positron propagating from «, to #; provided t, > t,. 

Thus one may think about a graph 
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which represents a contribution to the Compton effect on positrons as 
consisting of two virtual events occurring at x, and x» that are, for t, > t,, 
two virtual scattering processes with photon absorption at x, and photon 
emission at 2}, 


and, for ¢, < ¢,, virtual annihilation of an electron-positron pair at a, 
and virtual creation of a pair at x2, 


To 


t<ty 


lee I 


However, since an integration over 2, and 2, is performed in the compu- 
tation of the corresponding term of S$", the propagator will automatically 

‘take care of these possibilities, and one need not adhere to any up-down 
conventions as far as internal lines are concerned. 

The conventional nature of the phrase “‘ positrons are electrons running 
backward in time” should be abundantly clear by now, and the reader 
willnot be misled into concluding positron states are necessarily obtained 
by the operation of time reversal from electron states. 

From the expansion (24,22) for S{! and an identical expression for 
S$™ one can immediately read which terms in S, will contribute, for 
example, to the Compton effect on electrons. The only graphs containing 
one ingoing electron, one ingoing photon, one outgoing electron, and one 
outgoing photon are 
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i “rk » 
~ + + + 5 
Ay 4 Ses 
/ / A \ 
FEN 
‘ naa 
rs 4 
j i 


or, in analytical language, 


(24,23) 


(24,24) 


ira! 
SOR = <8 ff BPC) yp, ees) Wee) Yu, Hee) AGD (2) Af (0) 
+A (a1) AD (we)] d* ay d* xs. 


Similarly, one can read, for example, from the expansion for S{'Y), 


(24.25) se = 5 sera 
d=1 


the term which contributes to electron-electron scattering, namely the 
one represented by the sixth graph on the right-hand side of (24.25): 
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(24,26) 
SEES = (62/2) ff FP) 4p, 8 (0) HH) Yu, HU) 


arr 
3 A,,,(1) A,,,(®2) d‘ xd‘ x. 


One final task remains to be done before computation of matrix 
elements can be performed, namely evaluation of the propagators. 
According to (24.17) the fermion propagator is defined as 
(24.27) 
Olly) Plws)|0> fort, > te 
4(@1) P(w2) = (O|P Lp(a) F(x2)]]0> = | 
—<OlP(w2) W(ay)|0> for t < ty 
Upon decomposition of the operators ¢(a,) and if(a,) into positive and 
negative frequency parts, nonvanishing contributions to this vacuum 
expectation value are made by one term only in each case, namely 


(24.28) 
Offa) PP(ws)|0> for ty > te 


= OlP ay) P(a,)|0> for ty < te 
By substitution of the expansions (24.7) one obtains 


(24.29) 


a | 
(21) P(x) = { 


— z z A,(k,r) A4(k,r) exp [tk(x;—22)] for t; > ty 
W(x) P(e2) = z = 
-¥Y Y A_(k,r) A_(k,r)exp[—ik(a,;—a,)] for ty < ty 
i 
and summation over spins gives according to (19.52) and (19.53) 


(24.30)  ob(x1) B(xe) 


#3 i») exp [tk(71—a2)] 
K 


k + ; - 
= 2 anaes 169+) exp [—il(a, 4s) +i0(t —ty)], 


=> EM exp (ihe —a)] 
k 
= => Oat) exp [—ik(a,— 4s) —i2(h, —2)] 
k 


where @ is always meant to be equal to +/k®+m®, and where in the 
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second equation a change in summation label k > —k has been per- 
formed. 
This formula can be consolidated by utilizing the integral 


{Uae ay _ [ f(karertretint 


94 9 = = Sa oe 
(2631) | ame (ky—2)(k, +2) 
F 


F 


where k? = k?—k®, and { stands for integration along “‘Feynman’s 
P 


Fia. 24.1. Feynman’s contour. 


contour” in the complex k, plane, as indicated in Fig. 24.1. One has 
then to distinguish two cases. 


(a) t > 0. The contour can be completed in the upper half plane, so that 


(24.32a) 
I = 2niRes (ky = +) = 2milf(Q)/2Qle*me"™ — (¢ > 0). 


(b) t < 0. The contour can be completed in the lower half plane, so that 


(24.32b) 
I Ini Res (ky = —Q) = —2niff(—Q)/—2Q)e*e = (t < 0). 


Thus the propagator (24.30) can be written, under conversion of the sum 
¥,(.-./V) into the integral [1/(27)*]J...dk 


(24.33) 
1 k : 
Hey Meee) = saeye J ee uvesP ble —ae)}d*h = — Flea) We) 
F 


The computation of the photon propagator proceeds in complete 
analogy to the one of the fermion propagator. By substituting into 


(24.34) 

<0| A,,(%1) A,(v2) |0> = (0| Al) (a1) AY? (x2) |0> 
for t, > te 

<0| A, (a2) A, (21) |0> = <O| AD (#2) ALP (a) |0> 
for t, < ty 


aree. 
A,,(a1) Ay (#2) = 
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the expansions (24.7) one obtains 


(24,35) 

> eee vas ah ey (S. om [ine(aey —22)] 
Peeeral = 3 5; ir — (qi — Ge) + to(t; —te)], 
A,(%)A,(%2) = 


PSP ae a 16) 6A) ep inl tea) 
. S = 


The summation over the polarizations can be carried out in the gauge- 
invariant manner explained in Section 18 leading to (18.102), and 
formula (24.35) can be consolidated once again by exploiting the integral 
formula 


exp - te(qa — qe) — ten(ty —te)] 


(24.36) 
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Se ikatiot 
aie i eiket ihe eat) 
—il 
a ‘ | ky—w) (kg+o i oe ee 
; (kg ) (ha a 2 innit (f 10) 


yielding for the photon propagator finally 


(24.37) 

ary 1 1 kiuky Ky Ky 5 

Ayes) Aste) = sae [ <a{[8n— “ee ] +e4"HE*) exp fies —ay)ld 
) 


Since integration over any «; gives a 6 function, and since the computa- 
tion of matrix elements always involves integration over all vertex 
coordinates a;, one can formulate from the outset the rules for evaluation 
of Feynman’s graphs in momentum space, as summarized in Table 24.1. 


NOTES 


Since the advent of the work by Wick [1], the treatment of the 
scattering matrix by perturbation expansion has become rather stan- 
dardized and can be found in many texts with only small variations. 
Readers desiring to see this formalism applied to computation of cross 
sections are referred to Mandl [2], Kallén [3], Bogoliubov and Shirkov 
[4], Jauch and Robrlich [5], and Feynman [6]. 
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TABLE 24,1 
Frynman’s Rutes ror Evauuatine DrrecreD GRAPHS 


Sum over all indices appearing twice; integrate over the momenta of all internal lines 
along Feynman’s contour; sum over polarizations in internal lines. 


Graph Particle Direction Factor in matrix element 
i Electron Entering — [1/(27)3/2] A 4(k,7) 
aS Electron Leaving — [1/(27)3/2] 4 ,(k,r) 
h Positron Entering  [1/(2m)%/2]4_(k,r) 
4 Positron Leaving — [1/(2m)%/2] A_(k,r) 


7 Photon Entering [1/(27)*/J[e,(S)/V2w] (8 = 1,2 only) 


h Photon Leaving — [1/(27)9/2][e%(S)/V2w] — (S = 1,2 only) 


Vertex part iey,(2m)4 8(ka—kx— x) = IRs, ker + x, — x) 


e—<_. —_—Fermion propagating [1/i(Qm)4][b-+m]/[k2— m2] = G°(h) 


q 
Photon propagating —[1/i(27)4 21 (30 oo ‘) ft — = D(x) 


Ed 
¢=— =e 
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“SECTION 25 


The Hierarchy of Propagators 


The division of particles into “free” and ‘‘virtual”’ particles, corres- 
ponding to a separation of contributions to any Feynman graph into 
“external”? and ‘“‘internal’’ lines, is rather artificial and, from an 
operational point of view, in fact untenable. Since one cannot, in prin- 
ciple, “turn off” the interaction, it would appear more appropriate to 
view the world sub specie aeternitatis as an infinite concatenation of 
propagators, representable graphically as an infinite network of internal 
lines only. 

Consider, for example, an experimental situation in which one can 
meaningfully introduce an initial and a final state containing just one 
fermion. In the language of Feynman graphs one would represent this 
by an external fermion line entering into and emerging from a region of 
internal lines, as drawn in Fig. 25.1. However, before one can label an 
external line by some quantum numbers, the corresponding state must 
be prepared somehow, i.e. a measurement of the quantum numbers 
must be performed. Now any such measurement requires interaction of 
some kind, and in a pure quantum electrodynamics one is forced to infer 
the existence of two vertices, involving emission or absorption of photons, 
which are the terminals of the supposedly “external” lines, so that one 


Region of internal 
Hines 


Fic. 25.1. Hypothetical propagation of single fermion initial state into single fermion 
final state. 


should draw Fig. 25.1 more realistically as in Fig. 25.2, with the usual 

convention that t, >t, describes the propagation of an electron and 

tz <t, the propagation of a positron from x, to wy. Since any single 

fermion state can be considered as being raised from the vacuum state 
217 
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a 


Fia. 25.2. More realistic graph representing single fermion propagation. 


by the application of the appropriate creation operator, allowing one to 
write any matrix element of § between an initial state ‘‘fermion at a,” 
denoted |2;> = y(x;)|0> and a final state “fermion at x,” denoted 
<2x;| = (0|%(x;) as a vacuum expectation value 


(25.1) <a |S|x;> = <O| yp(aes) Sif (@;) [0, 


one is led to introduce the concept of the “true fermion propagator,” 
being the amplitude for fermion propagation between vertices x, and 
%2, denoted G(x2,a,) and defined by 


(25.2) G(ax2, 21) = (0 T[p(w2) S#(w1)] |0> 


as a chronologically ordered amplitude of the scattering matrix. The 
ordinary fermion propagator (24.27), now denoted @°(x»2,2;), is then 
obtained by omitting in the expansion of S all but the zero-order term, 
which is the identity, so that 


(25.3) Gay, 44) = <0 T[yp(w2) (ar1)] |O>. 
In momentum space the true propagator G(k) will be represented 


graphically as in Fig. 25.3, with a “region of ignorance,” hatched by 


k 


ik 


Fic. 25.3. Graph representing the true fermion propagator (region of ignorance hatched 
from SW to NE). 


convention from SW to NE for fermion propagation, which symbolizes 
all possible networks of internal lines which will contribute to G(x2,2,) 
and its momentum representation @(k) in accordance with the expansion 
of 8 in (25.2). The term “region of ignorance”’ for the region of internal 
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lines has been adopted, because no measurements beyond establishment 
of the value k at the terminal vertices are envisaged, and it should 
therefore be possible to find a measurement symbol, denoted M(k;k) 
after the precedent of Section 5, so that 


(25.4) G(k) = trace M(k; k) 
and 
(25.4') (ke) = [1/i(2m)*] (E+ m)/(k2—m)]. 


Similarly, the “true photon propagator” can be introduced as the 
chronologically ordered amplitude of the scattering matrix between 
single photon states, which may be represented in analogy to (25.2) as 
a vacuum expectation value, 


(25.5) D(a) %1) = 0|T[A, (#2) SA,(e)] |0> 


containing the ordinary propagator (24.42) as first term in the expansion 
of S, 


(25.5’) Diy (25%) = ¢0| T[A, (a2) A, (1) |9>. 


In momentum space the true photon propagator may be rendered 
graphically as in Fig, 25.4, the region of ignorance in this case being 


Fia, 25.4. Graph representing the true photon propagator (region of ignorance hatched 
from SE to NW). 


hatched by convention from SE to NW. One anticipates existence of a 
measurement symbol M,,,(«;«) associated with the momentum rep- 
resentation D,,,(«) of D,,(w2,21) so that 


(25.6) Dl) = trace M(x; «) 


and 


. 1 Ky, Ky\ OP, Ky 
(25.6') Do (x) = Taare 8 Be )+ 4 |: 
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Of particular interest for later development is further the “‘true vertex 
part’ ; rendered graphically in Fig. 25.5, with conventional crosshatching 


Li 


Fic. 25.5. Graph representing the true vertex part (region of ignorance crosshatched). 


of the region of ignorance in this case, Denoting its momentum represen- 
tation by I,(ky,k2,«), there should exist a measurement symbol 
M,(ky3k23«) so that 


(25.7) D,(h1, be, x) = trace M,,(ky; ka; «) 
and in lowest order 
(25.7') T9(key, kg, ke) = iey,(2m)*8(ky —ky— x). 


Finally, it is instructive to give special consideration to the true 
vacuum propagator, being the vacuum expectation value of the scatter- 
ing operator, rendered graphically in Fig. 25.6, as a pure region of 


Fic. 25.6. Graph representing the true vacuum propagator (region of ignorance 
hatched horizontally). 


ignorance, hatched horizontally by convention. The corresponding 
measurement symbol will be denoted M(0;0), and the vacuum propa- 
gator O(0) should be obtainable from it by 


(25.8) C(0) = trace M(0; 0). 


The actual construction of any measurement symbol can be carried 
out by noting that any matrix element of S can be written 


(25.9) ér'|S|z> = trace (S|r) <7’|). 


Thus the measurement symbol associated with the true vacuum propa- 
gator is simply 


(25.10) M(0;0) = 8|0> <0}. 
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Perturbation theory can now be viewed as an attempt to illuminate 
the spheres of ignorance associated with each true propagator by a series 
of successive approximations. Thus, up to and including the third order 
in the expansion of S, the true propagators could be rendered graphically 
as follows: 


(26.11) 


e-O:-@ 
~@-0-6-- 
o | 


i 


(25.13) © 


(25.14) 


ri 
A 


Inspection reveals that each graph always appears in conjunction 
with the same series of vacuum graphs. Assuming that these expansions 
make sense, one usually disregards all terms containing any disconnected 
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vacuum graphs on the grounds that doing so amounts simply to multi- 
plying all state vectors by an unobservable constant phase factor. Indeed, 
the eigenvalue C(0) of S in the vacuum state, 


(25.15) S|0> = C(0) |0> 
must have the modulus unity because of the unitarity of S, 
(25.16) |o|? =1 or C =e (a real) 


and in a series expansion C is numerically equal to the sum of all con- 
tributions to S with no unpaired operators. But this factor is the same 
for all matrix elements of S, because disconnectedness of the vacuum 
graphs is analytically equal to writing, for example 


(25.17) D + @- = P i) + (@)- 


so that omitting all vacuum graphs is equivalent to replacing S by 
(25.18) S38’ =C0"8 = e*8. 


At this point in the development one cannot avoid any longer facing 
up to a most perplexing difficulty which infects all perturbation theory: 
The second terms on the right in expansions (25,11)-(25.14) represent 
infinite contributions to the true propagators. Evaluation of these 
contributions with the rules laid down in Table 24.1 leads to a quad- 
ratically divergent integral for the second order vacuum graph, and to 
logarithmically divergent integrals for the second order fermion and 
photon propagator, and the third order vertex part. 

That these divergences appearing in the perturbation expansions of 
the scattering operator have not discredited this approach entirely and 
have not led to abolition of perturbation theory altogether is due to a 
number of facts among which the following looms most important. 
Employing the perturbation expansion and treating all infinite contri- 
butions as if they were finite (which formally can always be arranged by 
some invariant cutoff procedure), one can derive by an iteration pro- 
cedure a number of simultaneous equations satisfied by the thus evalu- 
ated true propagators, for which there is reasonable hope of being correct 
even though the expansion evaluation of the propagators turns out to be 
meaningless. 

The existence of integral equations linking in hierarchical fashion the 
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various spheres of ignorance in the presence of an interaction can be 
made plausible by the following elementary consideration. The basic 
interpretational postulate of quantum mechanics, as stated in Section 1, 
can be rewritten for the purpose of this section as follows: 

If P,, is the probability that if measurement of A gave the result a then 
measurement of B will give the result b, P,, the probability for finding 
the value c of C if Bis known to have the value 6, and P,, the probability 
for finding the value c of C if A is known to have the value a, then there 
exist complex numbers 


<alb>, <ble>, <alc> sothat P,», = |<alb>|?, P,. = |<b|o>|*, 
mi Poe = |<ale>|* 
an 


(25.19) <ale> = & <alb> <ble> 


if no attempt is made to measure B between measurements of A and C. 

The sum in (25.19) goes over all possible “channels,” that is all possible 
values of B through which the object may reach value c of C starting 
from value a of A. 

In the language of measurement symbols one can write this famous 
“addition of probability amplitudes” according to (6.2) and (6.3) 
(25.20) 

trace [M(c,a)] = ¥ <a|M(b)|c> = ¥ trace (M(c, a) M(b)] 
° ° 


= ¥ trace [M(b, a)] trace [M(c, b)]. 
b 


In the language of regions of ignorance this decomposition corresponds 
to a dissection of the original region of ignorance associated with M(c,a) 
into a network of other regions of ignorance, namely the ones associated 
with M(b,a) and M(c,b), thus 


(26.21) = 
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This equation is, of course, not sufficient to determine M(c,a), because 
all regions of ignorance associated with M(b,,a) and M(c,b,) can be 
dissected after the fashion of Eq. (25.21), and so on and so forth, corre- 
sponding to decomposing in (25.19) each probability amplitude in turn 
after the same recipe, thus <b|c> = 14<b|d> <d|c>, ete. 

Equation (25.19) and its graphical representation (25.21) encompasses, 
by iteration, an in principle infinite hierarchy of spheres of ignorance. By 
“hierarchy” is meant a simultaneous system of integral equations. 

Now, a remarkable feature of quantum electrodynamics is the possi- 
bility of writing down hierarchical equations for the regions of ignorance 
associated with the true propagators for fermions and photons in closed 
form, involving only these propagators themselves and the true vertex 
part. This possibility can be traced to the peculiar form of the interaction 
Hamiltonian, which amounts to the possibility of dissecting any inter- 
action process into elementary acts, represented graphically by vertices, 
in which single photons are either emitted or absorbed under observation 
of conservation of momentum and of lepton number. 

For the true fermion propagator one arrives by iteration at the hier- 
archical equation 


(25.22) 


which may be rendered analytically in momentum space as 
(25.22') 


G(ke) = Ge) + ie(2n)* G(ke) [ Lyle, kr, x) G (kee) Dye) yy d* G(R). 


Similarly, for the true photon propagator one arrives at the hierarchical 
equation 
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x 


(25.23) = 


which may be rendered analytically in momentum space as 


(25.23’) 
Dyy(te) = D9,,(e) + e(27)* Dyo(t) trace [ J T,(k—k, k, «) G(ke) G(x —k) 


xy d*k] D2, (1). 


Before one can attempt a solution of this set of equations, one requires 
a similar expression for the true vertex part. The sphere of ignorance 
associated with the true vertex part, however, has thus far eluded all efforts 
designed to write down a hierarchical equation in closed form. One must 
therefore state in all honesty 


(25.24) 


The question mark in Eq. (25.24) constitutes one of the important mathe- 
matical challenges posed by the existence of the electromagnetic inter- 
action. 

Some authors, in particular Landau and his co-workers, have ques- 
tioned the internal consistency of quantum electrodynamics on the basis 
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ofan approximation procedure, in which one substitutes for the question 
mark in (25.24) the expression 


(25.25) 


corresponding to adoption of an integral equation 

(25.25’) 

Ti(k,k—«, x) = 8(k,k—x,«) +f T,(k, k—«',x’) D,,(') G(k—x’) 
xD (k—n', k—k' —K, x) G(k—x' —k) 
xD(k—K' —K,k—-«, x’ )d*k’. 


The conclusions of these authors are open to doubt, because Eq. (25.25) 
cannot be a complete description of the true vertex part. It does not 
contain, for example, contributions to the true vertex part graph from 
decompositions of the type drawn in Fig. 25.7. 


Fra. 25.7. Type of graphs omitted from the expansion (25.25). 


In absence of any certain knowledge regarding the true propagators of 
quantum electrodynamics a concept known as “renormalization” has 
gained wide acceptance and been the object of a large body of learned 
literature. As to the meaning of renormalization, one finds in this liter- 
ature two quite different schools of thought. 
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(i) An examination of the perturbation expansions (25.12)-(25.14) 
reveals that the infinities in the true propagators appear only as factors 
Z,, Zo, Zs, and as an addition to the fermion mass, 5m, so that we may 
write 

G = 2.07% 
(25.26) Oy Fra iis 
Dy, = 23 Die 


with the understanding that G"*, 7?*, and Di? are regular functions, 
containing the fermion mass, if at all, in the combination 


(25.27) Moxy = mM+5m. 


The fermion propagator, in particular, can be shown to have in pertur- 
bation theory the form 


(25.28) @ = Z,|(k—mexy) 


which suggests the identification m,,, with the actually observed mass of 
the fermion. This procedure of making m+ 6m finite by decree is called 
“mass renormalization” and amounts to replacing the “bare” mass m 
by the “dressed” mass m,,, which contains through 6m all contributions 
to m,x, from the presence of interaction. 

Furthermore, since any factor e” in the perturbation expansion will 
always appear multiplied with the constants Z,, Z:, and Z, in the 
combination e*Z,Z7° Z3, one identifies 


(25.29) Cexp = €Z3" Zz! Ze 


with the experimentally determined charge of the fermion, a procedure 
called “charge renormalization” in this approach. 

Once the infinities of perturbation theory have been ‘‘taken care of” by 
this formal renormalization procedure, all contributions to the scattering 
matrix will be finite, and the calculation can be carried out, in principle, 
to any desired order in é%,,,. Renormalization does not, of course, guaran- 
tee that the remaining perturbation expansion in powers of ¢2,., converges. 

Investigations into the convergence properties of the renormalized 
perturbation theory are impeded by the always remaining laboriousness 
of perturbation calculations beyond the second order which is already 
quite cumbersome. In the few cases where such calculations have been 
carried outto the third and fourth order, and where comparison with obser- 
vation has been possible, the theoretical predictions have been confirmed 
with astonishing accuracy. 

The empirical success of this recipe for performing perturbation caleu- 
lations is remarkable because of the obvious meaninglessness of the 
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divergent constants Z,, Z:, Z3, and 8m, which appear as power series in 
the unrenormalized charge e® with divergent coefficients. By adhering to 
perturbation theory one abandons, in fact, all hope of obtaining true 
propagators free from infinities, and the renormalization procedure, in 
this form, must be recognized as a recipe which works for as yet unknown 
reasons. 

(ii) A completely different point of view regarding the meaning of 
“renormalization”’ has been advanced by a number of authors who 
suggest that the true propagators may not be uniquely determined by the 
hierarchical equations (25.22)-(25.24). The proponents of this approach 
assume, in fact, the existence of a set of true propagators @’, iis and Dee 
free from infinities and containing the coupling parameter e’, and propose 
that transition to a second set of true propagators G”, I, and Dj, free 
from infinities and containing the coupling parameter e”, will not result 
inany observable consequences, provided the two sets are related through 
the equations 


GQ” = 2,0! 

Megl 
(25.30) HAN belies 

Dis = 23 IDs. 


e"® = el®agl aban? 
where Z}, 22, 2g are arbitrary finite numbers. It must be stressed that the 
existence of such a “multiplicative renormalization group of trans- 
formations” cannot be proved by perturbation theory, because per- 
turbation theory excludes, from the outset, the existence of true pro- 
pagators free from infinities. The existence of relations (25.30) is at most 
suggested by perturbation theory, for if one replaces in (25.26) @ by z2G@, 
I, by ='I',, and D,,, by 23 D,,,, then this leads to a change in the effective 
value of the charge, meaning replacement of ¢2,.,, by 2x23 21-23. Therefore, 
if one simultaneously performs a compensating renormalization of charge 
e* +e? 2512425", then as a result of all these operations no change in e2,, 
is obtained. 

The very generality of this approach is its weakness. The invariance 
requirement expressed in Eqs. (25.30) is far too weak as a condition 
imposed on the unknown true propagators, and thus insufficient to lead 
to their unambiguous construction. 

Before one can extract information regarding the analytic properties of 
the true propagators from (25.30), one has to make some assumptions 
about the dependence of the propagators on their arguments. It is, for 
example, customary to assume that the true photon propagator should 
be representable in the form 
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d(x* Ky ky) d(x) KK 
(25.31) Dy (*) io (bu ea ‘) + a ) ee 
with unknown regular functions d(x”) and d,(x*). Such a form is suggested 
by the free photon propagator D!,,, corresponding to d(«*) = 1. It should 
be kept in mind, however, that a specific form such as (25.31) contains 
drastic assumptions regarding the analytic properties of D,,(«°), as 
exemplified by the existence of a pole at x” = 0. Similarly, the true fermion 


propagator is customarily assumed to have the form 


a(k?) k—b(k®) m 


(25.32) Ok) = 


with unknown functions a(k*) and b(k?), showing once again the per- 
suasiveness of perturbation theory, which contains the free fermion 
propagator as the lowest approximation in the form 


Once specific forms such as (25.31) and (25.32) have been adopted, 
Eggs. (25.30) become functional equations for the unknown functions d, 
a, and b, giving opportunity for the employment of elegant mathematical 
techniques. This approach will not be pursued further here, because one 
may seriously doubt the existence of any analytical resemblance between 
true and free propagators. There just does not exist at present any 
detailed certain information about true propagators that does not have 
its roots in perturbation theory. Even the functional equations for d, 
a, and } have never been solved without relapse into perturbation- 
theoretical arguments, which are always invoked when particular values 
of the true propagators are needed to specify, for example, the limiting 
behavior of d for a small coupling parameter e”. 

In recent years ingenious attempts have been made to break out of this 
circle of frustration circumscribed by perturbation theory by imposing 
very general analytical features on true propagators, allowing one to infer 
general relationships between the real and imaginary parts of the true 
propagator, which have become known as “dispersion relations.”’ For 
example, it has been conjectured that the true photon propagator may 
be decomposed into contributions from propagators for free particles 
with different masses. Thus d(x?) is written 


Axe? Lf p(M)aM 
(25.33) ol - ce [oe | 
0 
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with unknown spectral function p(J/*), allowing one to infer the disper- 
sion relation 


U'®) de’? 


Ke — Kee 


+0 
(25.34) Im [d(x2)] =: J Re 


-—« 


The specific spectral representation (25.33) contains the tacit assumption 
that the true propagator will always have a pole at x? = 0, corresponding 
to the pole exhibited by the free photon propagator, independent of the 
value of the coupling parameter. Some recent work by Schwinger (see end 
of Section 21) has thrown doubt, however, on the validity of this assump- 
tion, which seems so harmless and reasonable at first sight. 

Ina theory aimed at abandoning any reliance on perturbation theory, 
one cannot rely on starting with the assumed convergence of the iteration 
procedure, which in perturbation theory enables one to decompose any 
true propagators into three basic constituents, namely the fermion 
propagator G@, the photon propagator D,,, and the vertex part I’,. 
Accordingly, the main effort in the theory of dispersion relations has been 
directed at guessing immediately analytic properties of more complicated 
true propagators, for example, the propagator associated with the four- 
fermion vertex as drawn in Fig. 25.8, representing all fermion-fermion 


Fic. 25.8. Graph representing the four-fermion vertex. 


scattering processes, with a region of ignorance encompassing all possible 
channels which may contribute to the amplitude of this particular 
process. Without trying to disparage these heroic efforts, one should 
admit that all so-called dispersion relations put forward to date have the 
character of ingenious guesswork regarding the analytic properties of 
true propagators. 

The very existence of fermions and bosons with baffling numerical 
ratios of their masses would seem to indicate a much more intricate 
structure of even the simplest propagators than is commonly surmised. 
It should not surprise anyone if the enigma of the mass spectrum isin fact 
a self-consistency problem involving the entire hierarchy of propagators. 
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From this point of view any imminent breakthrough towards the solution 
of this problem seems improbable, and further arduous work should be 
anticipated. 


NOTES 


Integral equations connecting true propagators were first derived 
formally by Schwinger [1]. 

The concept of the true propagator is already contained in the work 
by Dyson [2] who initiated the current versions of renormalization theory. 
A comprehensive treatment is given in the book by Bogoliubov and 
Shirkov [3]. Examples of the point of view (i) regarding the meaning of 
renormalization are the papers by Killén [4] and Lehmann [5]. The 
point of view (ii) invoking the existence of a multiplicative renormali- 
zation group of transformations has been promulgated by Stueckelberg 
and Petermann [6], and Gell-Mann and Low [7]. 

The internal consistency of quantum electrodynamics has been ques- 
tioned in a series of papers by Landau et al. [8]. See also Landau et al. 
(9] and Landau [10]. 

Readers interested in dispersion relations are referred to the article by 
Mandelstam [11]. 
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SECTION 26 


On Selection Rules Due to Symmetry under Inversions 
and Rotations of Coordinates 


Despite the perplexities besetting the internal consistency of quan- 
tum electrodynamics, it is, judged by the criterion of being able to make 
quantitative predictions, a satisfactory theory, The strong and weak 
interactions other than electromagnetism have not yet been grasped 
with a tool of similar usefulness. The absence of any profound dynamical 
insights contrasts sharply with the firm knowledge already gained about 
some symmetry properties of these otherwise ununderstood interactions. 
The purpose of this and the following three sections is to exhibit in a few 
examples, chosen for their simplicity, how elementary symmetry con- 
siderations can have the power of leading to detailed predictions. Since 
here, more than in any other section, any attempt at completeness 
would explode the resolve to keep this work within bounds, all effort will 
be directed towards the modest didactic aim of whetting the reader’s 
appetite for more, thus providing him with a motivation for learning 
group theory, which is the language fitted to and almost indispensable 
for the systematic disentanglement of symmetry properties. 

The consequences of symmetry can be conveniently categorized 
according to the different types of transformation operations associated 
with the various symmetries. 


(I) Symmetries that can be formulated as transformation properties 
of state vectors under unitary operations in coordinate space. In this 
category belong conservation laws and selection rules generated by 
invariance under inversions and rotations of coordinates, which are the 
subject of this section. 

(II) Symmetry under permutation of identical objects, which can be 
represented by a unitary transformation in occupation number space. 
This symmetry will be the subject of Section 27. 

(III) Symmetries that must be formulated as transformation pro- 
perties of state vectors under antiunitary operations. In this category 
belong 

(a) Superselection rules, of which examples have already been 

mentioned in Sections 17 and 19. 
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(b) More subtle consequences of time-reversal invariance such as 
the principle of reciprocity and the principle of detailed balance. 

(c) Consequences of invariance under particle-antiparticle con- 
jugation. Details of iterms (b) and (c) are the subject of section 28, 
(IV) Symmetries in state vector spaces representing abstract intrinsic 

properties such as isospin. These will be treated in Section 29. 


A simple physical object which exhibits symmetries of categories 
(I) and (IJ) in a nontrivial fashion is the system of two photons. In parti- 
cular, one of the simplest systems whose parity can be observed is the 
two-photon system in a state of total momentum zero. 

As had been shown generally in Section 14, the operation of coordinate 
inversion can be represented by a unitary operator IT, and whenever the 
Hamiltonian of a physical system is invariant under JJ, conservation of 
parity ensues. The conservation of parity in strong and electromagnetic 
interactions can often be used to monitor intrinsic properties of unstable 
objects by observation of the parity of their decay products. 

A single photon state |x,S, defined as in (18.23), can never be a state 
of definite parity, because of the transformation properties 


(26.1) II |x, R> = |—x, Ly; II |x, L> = |—x, RY 


in the combined momentum-polarization space. More generally, any 
single photon state (18.13) or (18.14) should transform according to 


(26.2) T1,,2> = )14,>5  H|1,,1> = alla, n> 


where 7 is an as yet undetermined phase factor subject only to the 
condition 


(26.3) n=l 


imposed by the requirement that the operation IT? be equivalent to the 
identity. Consequently, the creation and annihilation operators should 
transform as 


TIb* (x, R) I~! = nbt(—x, L); TIb+(x, L) IT~* = nb*+(—x, R) 
TTb(x, R) IT~* = yb(—x, L); ITb(«, L) IT? = nb( —x, R) 
This phase factor 7 is sometimes given the unfortunate name the 
“intrinsic parity”’ of the single photon state. Its value depends on the 
transformation properties assigned to the operator (18.75) of the vector 
potential. With the usual convention of giving the magnetic field operator 


the transformation character of an axial vector, the vector potential 
must change sign under coordinate inversion, 


(26.5) ITA(q,t) -* = — A(—q,t). 


(26.4) 
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The physical meaning of the factor 7 is obscure. It would be quite wrong 
to think of a photon “‘carrying”’ this “intrinsic parity” as it travels 
with momentum x, because the concept of a rest frame for a massless 
particle is meaningless, and only in such a frame can an object have 
definite parity. Fortunately, the transformation properties of any two 
photon state do not depend on 7, since such a state is raised from vacuum 
by application of two creation operators, and therefore the operation JT 
will always generate only the factor 7? = 1. 

If the momentum x of one photon is given, one can construct four 
different two-photon states of momentum zero, namely 


|RR> = b*(x, R)b+(—x, R)|0> 
|LL) = b*(x, L) b*(—x, L)|0> 
| RLY = b+ (x, R)b*(—x, L)|0> 
|LR) = b*(x, L)b*(—x, R)|0> 


(26.6) 


|RR) |LLy 


|RLy |LRy 

Fia. 26.1, The four two-photon states of total momentum zero. 
which are rendered graphically in Fig. 26.1 and which transform under 
inversion of coordinates according to (26.4) as 
(26.7) 
TI|RR> = |LL); TI|LL> = |RR>; T|RLY = |RL); T|LR)> = |LR). 
The eigenstates of IT are therefore 
(26.8a) |RR>—|LL> with eigenvalue P= —1 
and 


(26.8b) 
|RR>+|LL); |RL>; |LR> with eigenvalue P = +1. 
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The states | RR) —|LL> and |RR> +|LL) can be distinguished experi- 
mentally because the planes of polarization of the photons show distinc- 
tive correlations in each case. To see this, decompose in accord with the 
precedent given in (18,69), circular polarizations into linear polarizations 
by writing 


b*(, R) = (1/2) [b*(, 1) +4b+ (x, 2)] 


(26.9) : 
b*(x, L) = (1/2) [b+(x, 1) —ib+(x, 2)] 
yielding 
(26.82) 
|RR>—|LL> = [b+ (x, R)b+(—x, R)—b*(%, L)b+(—x, L)]|0> 
= —i[b+(«, 2)b+(—x, 1) +b*(x, 1) b+(—x, 2)]|0> 

and 

(26.8b’) 


|RR>+|LL> = [b*(x, 1) b*(—x, 1) +b*(x, 2) b+(—x, 2)]|0>. 


This means the state |RR>—|LL) is created whenever the emission of a 
photon polarized along one axis is accompanied by the emission of a 
photon polarized along the orthogonal axis, i.e. the planes of polarization 
in this state are always perpendicular to each other. State |RR>+|LL), 
on the other hand, differs from the vacuum state by creation of two 
photons whose planes of polarization are always parallel. The states | RL) 
and |ZR) show no such correlations, because one has, for example, 


(26.8b”) 
|RL) = 4[b+(, 1) b+(—x, 1) +b+(e, 2) b+(—x, 2) + ib*(e, 1) b+(—x, 2) 
—ib*(x, 2) b+(—x, 1)]|0> 


so that the probability for finding the planes of polarization parallel or 
perpendicular to each other is exactly } in each case. 

When an object of total angular momentum zero decays into two 
photons, then observation of the correlation between the planes of 
polarization will reveal the intrinsic parity of that object, provided the 
decay is mediated by an interaction invariant under inversion of co- 
ordinates so that parity is conserved. Examples of such objects are the 
neutral pion 7° and positronium in the singlet ground state [see Eq. 
(27.22) below]. In both cases, observation has shown a correlation 
indicative of the state |RR>—|LL), revealing both the 7° and the 
positronium in the singlet ground state as pseudoscalar particles, i.e. 
having intrinsic parity P = —1. 
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The two-photon states of definite parity have interesting transfor- 
mation properties under rotations. Consider first a single photon state 
|x,S> and perform a rotation, denoted U ,, of angle 7 around an axis 
perpendicular to x. It is intuitively clear that in the thus rotated co- 
ordinate frame the photon will appear to be travelling in direction —x, 
but with the same handedness. Thus single photon states transform 
under U , according to 


(26.10) U, |x, R> = |—x, R); U, |x, L> = |-%,L». 
Accordingly, the two-photon states (26.6) will transform as 
U,|RR) = |RR); — U, | LL) = | LL) 
U,|RL) =|LR); 0, |LR) = |RL). 
The simultaneous eigenstates of IJ and U , are therefore 
(26.12a) |RR> —|LL> 


(26.11) 


(26.12b) |RR> +|LL> with eigenvalue U, = +1 
(26.12c) | RL) +|LRY 
(26.12d) | RL) —|LR> with eigenvalue U, = —1. 


The linear combinations (26.12c) and (26.12d) are no longer eigenstates 
of the operator of rotation U, around an axis parallel to x, which is not 
surprising because they are linear superpositions of states with different 
values of the component of angular momentum in direction x. 

From these transformation properties one can deduce a selection rule: 
A particle of spin 1 and of definite parity cannot decay into two photons. 

To prove this, one observes first of all that the spin of the particle can 
have, in the direction of », only the values m = +1,0,—1. Since the 
values of angular momentum in direction x available in two-photon 
states (26.6) are m = — 2,0, +2, it follows, by conservation of angular 
momentum, that only the state m = 0 can decay into two photons. This 
leaves as possible final states |RR>—|LL> and |RR>+|LL>. However, 
the state with m = 0 of the particle transforms as the spherical harmonic 
Y},0 = cos# (# is the angle with respect to x, see Appendix 1), 


(26.13) U,|m = 0> = —|m = 0) 


exhibiting the eigenvalue U , = —1, and thus removing the two remain- 
ing states (26.12a) and (26.12b) from the possible list of final states, 
because they belong to the eigenvalue U, = +1. 

Contrary to the case of massless particles, it should be possible to define 
the concept of intrinsic parity for single particles with mass unequal zero, 
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because such particles can be found in states of definite parity provided 
they are at rest. Of particular interest are the fermions of spin } whose 
7% functions satisfy Dirac’s equation. As had been shown in Section 19, 
the operation of coordinate inversion can be represented for these 
particles by 


(26.14) IT = npysHp 


where »p is a phase factor which may have one of the four possible values 

+ lor +i depending on whether the operation I7* of coordinate inversion 
applied twice has the value +1 or —1. In accordance with the discussion 
of Section 14 in connection with the double-valuedness of y functions 
for particles of spin }, the convention J7* = +1 is adopted here and only 
the case np = +1 will be considered. 

For a single electron or positron state at rest and in an eigenstate of 
II, with eigenvalue P p, denoted |e* > = |k = 0,r, Pp, L = + 1> where ris 
the spin label and L the lepton number, one can define the intrinsic 
parity P; by 
(26.15) I] |e*> = P;Pple*). 


If the electron or positron is characterized by the orbital angular momen- 
tum quantum number / one will have, in general, P) = (—1)' as had been 
shown in Section 14. Now from the representation of |e~> in spin-chirality 
space by the ¢ functions (see Table 19.2) 


(26.16) 
1 0 
0 ; 1 
A,(k = 0,r = 1) = const} | J; A,(k = 0,r = 2) = const 0 
0, 1 
it follows that, with the representation (19.5) for y4, 
(26.17) T1|e~> = npP ple. 


Thus, the phase factor np can be identified as the intrinsic parity of the 
electron, 
(26.18) P,(e-) = np. 
On the other hand, a positron state |e*> is represented in spin-chirality 
space by 
(26.19) 

1 


A_(k = 0,r = 1) = const a ; A_(k = 0,r = 2) = const 
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so that 

(26.20) I1|e+) = —npPplet> 

and one must conclude the intrinsic parity of the positron is 
(26.21) Piet) = —np. 


In other words : Although the intrinsic parity of a fermion satisfying Dirac’ s 
equation is not uniquely determined, particle and antiparticle have always 
opposite intrinsic parity in this case. 

Consequently, positronium which can be described by a product state 
|e~> x |e*> has intrinsic parity P; = — 1. The total parity of positronium 
in a state of definite parity is therefore 


(26.22) P(positronium) = —P»p. 


One has thus the selection rule: Positronium in a state with even 
1(Pp = + 1) cannot decay into an n-photon state with even parity, and 
positronium in a state with odd 1(Pp = —1) cannot decay into an n- 
photon state with odd parity. 

The optical transitions between different levels of positronium are not 
affected by the intrinsic parity, because only changes in Pp are observed 
in that case, and the selection rules following from conservation of parity 
and angular momentum are the same as in the corresponding hydrogen 
spectrum, 

The possibility of establishing unambiguously the relative parity of 
electron and positron P(e~e*) is intimately linked to the separation of 
electron and positron states by a superselection rule, which can in this 
case be associated with the conservation of lepton number, and which 
allows one [see Eqs. (15.52) and (15.53)]to speak meaningfully about the 
relative parity of the two states. It is not possible, on the other hand, to 
measure the relative parity of fermion states belonging to different 
particles in a given charge multiplet, such as neutron and proton states, 
because these are not separated by a superselection rule but rather by a 
conservation law such as the conservation of charge (which must be kept 
separate from conservation of lepton number or baryon number) 
following, presumably (see Section 20), from some kind of phase invari- 
ance, making the relative phases of the states belonging to a multiplet 
arbitrary. The relative parity P(pn) must therefore be fixed by conven- 
tion, and it is customary to assume that fermions belonging to the same 
multiplet have the same intrinsic parity, so that in particular one 
assumes P(pn) = +1. Similarly, the intrinsic parity of strange fermions, 
such as A, X, and 3, remains undetermined unless one fixes by con- 
vention the relative parity between nucleon N and one of them, P(N A) 
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say. The customary assumption is P(N A) = + 1. Once these two assump- 
tions have been made, the intrinsic parities of massive bosons, such as 
a and K, and of the baryons N, A, X, and & can, in principle, be deter- 
mined. 

At the time of writing, the knowledge of intrinsic parities of so-called 
elementary particles is incomplete. It is interesting to note, however, 
that all massive bosons, including the ‘“‘resonances”’ 7, w, p, ete., have 
odd intrinsic parity. All baryons, on the other hand, whose intrinsic 
parity is known at all, have even intrinsic parity. Whether this curious 
fact represents a general rule is unknown. 


NOTES 


Yang [1] first analyzed completely the symmetries of the two-photon 
state. The question of how one can observe in principle the correlation 
between the planes of polarization of the two photons emitted in the 
decay of the 7° meson was also answered by Yang [2]. 

For a detailed discussion of how to obtain, in principle, information 
about the relative intrinsic parity of baryons and massive bosons see 
the article by Sakurai [3]. 


REFERENCES 


[1] C. N. Yang, Phys. Rev. 77, 242 (1950). 

[2] C.N. Yang, Phys. Rev. 77, 722 (1950). 

[3] J. Sakurai, in ‘Brandeis Lectures in Theoretical Physics, 1961,” Vol. 1, 
p. 231. Benjamin, New York, 1962. 


SECTION 27 


Permutation Symmetry of Multiple Particle States 


The two-photon states defined as in (26.6) have the flaw of not 
exhibiting the permutation symmetry characteristic of multiple particle 
states. Consider generally a two-particle boson state 


(27.1) |lay1p.> = O*(71) b*(r2) |0> 


where 7 is a complete set of quantum numbers which can be used to 
characterize otherwise indistinguishable particles. ‘Indistinguish- 
ability” of particles means that it should not be possible to affix any 
other label to a particle beyond the set 7+. Expression (27.1), however, 
does not yet take care of this requirement, because the order in which 
the operators b+ have been written implies an additional labeling of the 
bosons, since the boson having quantum numbers 7, may, by a conven- 
tion which counts the order of operators from left to right, be called the 
“first”? boson and the one belonging to the set r2 the “second” boson. 
Physically, the state (27.1) should be indistinguishable from the state 


(27.2) Ley 1,,) = B*(72) b* (71) |0> 


in which, by the labeling convention employed above, the “first” boson 
has quantum numbers r, and the “second” has quantum numbers 7,. 
From this requirement of indistinguishability one can infer the existence 
of a unitary operator 7',, of transposition connecting the states so that 


(27.3) [11> = Trolle 1. = Li2b*(r1) 6*(r2) Tit T12|0> 
and thus 
(27.4) T2b*(71)b* (72) Tig = b* (72) b*(41) 


by the usual convention that the vacuum state is invariant, 7',2|0> = |0>. 
By an analogous argument, Eq. (27.3) will be found to exist for two 
fermion states as well, so that (27.4) will hold also if the operators b+ are 
replaced by the corresponding fermion creation operators at. 
Application of 7';, twice must, up to a phase factor which is set equal 
to unity by convention, restore the initial state, 


(27.5) Pilestso= |lavte >: 
241 
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The possible eigenvalues of 7';. must therefore be +1 and —1. The 
corresponding eigenstates will be called “symmetric” and “anti- 
symmetric” with respect to transposition and labeled |1,,,1,,>, and 
|1,,,1,,2« respectively, so that 


(27.6) Poller lors = +[lay lars 
(27.7) T9|1 = —|L 12a 


a Lay) 
From the C.R.s of the boson creation operators b+ and the anti-C.R.s of 
the fermion creation operators a* it follows immediately that two- 
particle boson states are symmetric and two-particle fermion states are 
antisymmetric under transposition. Indeed, for bosons Eq. (27.3) can be 
worked out to give with the help of the C.R.s (18.12) 


(27.8) 
Ty2\ 1s Lp = bt (r2)b4(71)|0> = +b*(r1)b*(72)|0> = +]1,51,>2 
and for fermions the anti-C.R.s (17.17) yield 


(27.9) 
T 21.) L.Dr = a*(r2) at(r3) |0> = —at(r,)a*(r2) |Oo> = —|L» 1D: 


In terms of single particle states this transformation property can be 
made manifest by writing the symmetric two-boson state 


(27.10) [Las Lede = (1/2) [|1p.>2 [Let [14.1 [12] 


where now the transposition 7',, operates on the subscript particle 
labels («= 1,2) as in | >,. Similarly, the two-fermion state can be 
written as an antisymmetrized combination of single particle states, 


(27.11) [Dep Leda = (1/V2)[]1,,91 [Lee [Le [1,92] 


Invariance of the Hamiltonian under transposition of particle labels 
means 


(27.12) T,,HTy = H or T,,H—HT,, = 0. 


It follows in this case that the eigenvalues of 7’. must be constants of 
the system in time. If, thus, at any given instant the state vector of a 
system is either symmetric or antisymmetric under transposition of 
particle labels, it will remain so for all times, unless an interaction 
energy is introduced which is not invariant under this transposition, 
i.e. which introduces a feature allowing the particles to be distinguished. 
Such apparent ‘“‘nonconservation of permutation symmetry” can, 
however, always be interpreted to mean that the set of quantum numbers 
considered is incomplete. Thus, if one transposes only the momentum 
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label, but not the spin label, of two fermions described completely by 
momentum and spin, one may find the state to be either symmetric or 
antisymmetric under transposition of momentum labels alone, but 
transposition of both momentum and spin labels must in this case lead to 
antisymmetry of the state vector. 

By straightforward generalization of the argument for two-particle 
states one finds any many-boson state |n,,,n,,,-..> to be symmetric 
under any permutation F of the particle labels, 


(27.13) Pl tings Wess oY = [Mays Mere ee 

and any many-fermion state |N,,,N,,,...> to be antisymmetric under 
that permutation, 

(27.14) PUN Norges = Op | Nin es s3)) 

where 

ena ele tome’ 


Anyone familiar with the learned language of group theory will thus 
recognize boson and fermion states as belonging to the only two existing 
one-dimensional representations of the permutation group, namely the 
completely symmetrical and completely antisymmetrical representation, 
respectively. This is a direct consequence of the empirical facts embodied 
in the C.R.s and anti-C.R.s, namely that the occupation number of 
boson quantum states is unlimited and that the occupation number of 
fermion quantum states satisfies the exclusion principle. 

Although there exist higher dimensional representations of the 
permutation group corresponding to different restrictions on the possible 
occupation numbers of quantum states, none of these appear to be 
realized in nature. The reasons for this peculiar empirical fact are not 
completely understood at present. 

As an example illustrating in a nontrivial fashion the concepts 
developed in this section consider once again the two-photon state, but 
use instead of the quantum numbers x, S (momentum, polarization) the 
quantum numbers w, j, m, P (energy, angular momentum, parity) (see 
Appendices 2 and 3) to characterize each single photon state contributing 
to the two-photon state. Any two-photon state is then a linear combina- 
tion of products |I,2m,>|s,m,>, where |l,m,> represents a possible orbital 
state of two photons and |s,m,> a possible spin state of the two photons. 
These linear combinations for given total angular momentum state 
|j,m> have to be chosen in accordance with the rules governing the 
addition of angular momenta, given in Appendix 2, where it is also shown 
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that for |s,|, |s)| = 1 there are nine spin states |s,m,> which transform 
under transposition of particle labels as 


(27.16a) T's2|8,m,> = +|8,m,> for s = 0,2 

(27.16b) T 12|8,m,> = —|8,m,> for: se = 1. 

The orbital state, on the other hand, transforms as 

(27.17) T 2|l,m> = (- 1) |l,m)>. 

This follows from the possibility of characterizing the orbital state 
completely by a y function ;,, (%: —%2) = ¥,,(%) depending only on the 
relative momentum x, —x, = x of the two photons, because in momen- 
tum space the Hamiltonian for the two photons can be written 

(27.18) 

H = +£(8;%,+82%2) = +[4(81 +82) (%1 +2) + $(S1 F Sz) (%1 —2)] 
which reduces to + 4(s; $ S)% in the center of mass frame. The operation 
of transposition of particle labels has thus the effect 
(27.19) P10 ty (%) = Pay —*) 


and is therefore, as far as the orbital motion is concerned, equivalent to 
the effect of the inversion of coordinates, 


(27.20) T2%,,(*) = Ih,,,(%)- 


Since the parity of any state of orbital angular momentum 1 is (—1)! 
(see Section 14), Eq. (27.17) follows, which may be written 


(27.21) T'y2|l,m> |s,m,> = (—1)'|L, m>Tr2|8,m,>. 
Since the complete state is required to be symmetric, 
(27.22) T 12|j,m> = +|j,m)> 


and the parity of the state is determined solely by its orbital part, one 
must conclude that to a two-photon state of even parity (i.e. 1 even) only 
symmetric spin states (i.e. s = 0 or s = 2) can contribute, whereas to a state 
of odd parity (i.e. 1 odd) only antisymmetric spin states (i.e. 8 = 1) can 
contribute. 

There is an interesting alternative classification of the polarization 
states of two photons. In Section 18 the polarization space of the photon, 
spanned by the eigenstates of S;, namely |R> and |Z, had been shown 
to be isomorphic to the space spanned by the eigenstates |4, +4> and 
|4, —4> of the angular momentum operator s, belonging to spin s = + 
[see Eqs. (18.57)—(18.60)], Consequently, the rules governing addition of 
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angular momenta j = } are applicable to the addition of photon polariza- 
tions. To bring out this analogy, the polarization states will be labeled 
with the quantum numbers $5, and 48, for each particle, so that one 
writes, in general, 


(27.23) |S> = |45o, 45> 
and, in particular, for single photons, 
(27.24) IR>=|2+bs  |L> = |, -b. 


The polarization states for two photons follow now from the rules of 
addition, laid down in Eq. (A2.9) of Appendix 2, giving rise to four 
polarization states, consisting of an antisymmetric singlet and a sym- 
metric triplet in polarization space, namely 


(27.25) 489 = 0: |0,0> = (1/72) [|R>1|Z>2—|LZ>1|R>e] 
|L, 1 > |R>, |R>2 
(27.26) $89 = 1: [1,0> = (1/972) [|R>1|Z>2+|L>1 [Re] 


|1,-1> = |Z)1|D>e 


Since the Hamiltonian for free photons commutes with the operator § 
of polarization, $8) and 4S, are “good’’ quantum numbers to describe 
any system of noninteracting photons. It is an intriguing and open 
question to what extent one can use rotational symmetry in polarization 
space, i.e. a “law of conservation of polarization,” to restrict possible 
choices of interaction Hamiltonians containing only photon creation and 
annihilation operators designed to account phenomenologically for 
possible photon-photon interactions. The question of whether emission 
or absorption of photons by other objects “conserves polarization” 
cannot be answered until one has defined the meaning of the polarization 
operator for these other objects. 

The operator of transposition 7,2, which had been introduced sym- 
bolically in (27.3), can actually be constructed in many cases if one 
specifies the variable on whose particle label 7’. should operate. One 
can, for example, represent the transposition of spin labels for 7 = 4 by 
the operator 


(27.27) Ty = H1+0(1)-0(2)] 


where o(1) and o(2) are the Pauli spin matrices operating on the spin 
state vectors |4, + 4,» of particle 1 and particle 2, respectively. Because 
of the relations 


(27.28) of =F; 0,0; = io, (cyclically), 
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the operator o(1)- (2) satisfies the quadratic equation 
(27.29) [o(1)-o(2)]? + 2[6(1)-6(2)]-3 = 0 


and therefore the eigenvalues of the operator o(1)-o(2), & (say), since 
they must also satisfy 


(27.30) & 42-3 = 0, 
are 
(27.31) éé+=41 and ££ = -3. 


The corresponding eigenvalues of 7',o, n (say), are thus 

at = Hl+ét) = +1 

a = K1+e) = -1 

and 7';, satisfies condition (27.5), 

(27.33) TR, = {1+ 2[o(1)-a(2)] + [o(1)-0(2)}?} = I. 
Relations (27.28) are also employed when one wishes to verify that 
Ty20(1) Td = o(2) 

T20(2) Tid = o(1). 


(27.32) 


(27.34) 


One finds, for example, 


(27.35) 
T9.04(1) = $[03(1) +04(2) —t09(1) o9(2) +%09(1) o9(2)] 


o1(2)T 12 = $[o4(2) +.04(1) + t09(1) o3(2) —t9(1) o2(2)] 
so that 
(27.36) Ty90,(1) = 04(2) Tie or Ty20,(1)Ti3 = 01(2) 


which verifies the first of the six relations (27.34). The eigenstates of 7’, 
are the antisymmetric singlet and the symmetric triplet (A2.9), 


Ts2|0,0> = —|0,0> 
Ty2|1,m,> = +|1,m,>;  _m, = +1,0, -1; 


as can again be seen by straightforward computation. For example, with 
the usual representations for o and |4, + >, one finds 


(27.38) Ss ee ee ee 
210,0> = 5[2+(; ae o),+(; a A 


(0 ao -1)dvalld),().-0),(0)J 


(27.37) 


27. Permutation Symmetry of Multiple Particle States 247 


raion = -2f0)(0)-0)()]- 90 


Expressions analogous to (27.27) can be found for the operator of 
transposition of spin labels for values j > 4. If one denotes the operators 
of angular momentum for particles 1 and 2 by J(1) and J(2), respectively, 
one requires, in general, for the representation of 7’. a polynomial in 
J(1)-J(2) of order 2j. This is so because the operator J(1)-J(2) satisfies, 
in general, an algebraic equation of order 2j+1. For j = 1, for example, 
one has the relations, readily obtained from representations (18.65) 


(27.39) Ji= Ji I JjJ, = 0; IP+dZ+S§ = 2; 
PI, =I Ip; Idi J, = il -J}); JJ} = ,J;  (ijkeeyel.) 
so that 

(27.40) [5(1)-5(2)]? + 2[5(1)-J(2)]?—[F(1)-J(2)]—2 = 0. 


The eigenvalues of J(1)-J(2) satisfy thus the cubic equation 


(27.41) é842¢7-€-2 = 0 

which has the three solutions 

(27.42) &=+1; 6=-1; &= -2 
The operator 

(27.48) Ty = [I(1)-d(2)]2 + [I(1)-I(2)J—1 


has then the eigenvalues 


+ = +1 corresponding to ; and 
(27.44) by) ip ig to fy &3 
y = —1 corresponding to £2 


and satisfies the condition 
(27.45) T?, = 1, 
The eigenstates of 7’';» in this case are the multiplets (A2.10), 
T'12|0,0> = +|0, 0) 
(27.48) Ty9|1,m,) = —|1,m,> m, = +1,0,-1 
T2|2,m,> = +|2,m,> m, = +2, +1,0, -1, -2. 


It is interesting to note that the operator J(1)-J(2) removes the 
degeneracy with respect to 7’, of the symmetric states |0,0> and |2,m,), 


248 Concepts in Quantum Mechanics 


they belong to the eigenvalues é, and é,, respectively, as follows from a 
simple computation : 


(27.47) J(1)-3(2)|0,0> = —2]0,0> = &,|0, 0> 
(27.48) I(1)-F(2)|2,m,> = +|2,m,> = €1|2,2,>. 


A representation can also be found for the operator of transposition 
of particle labels applied to a complete set of attributes. Such an operator 
would be required to have the property, for fermions, 


(27.49) Tf =Te;  Tyya(1) Ty? = a(2) 
so that also 
T,.a*(1) Ti = at(2) and T1.a(2) Ti = a(1) ete, 


Here a+(1) refers to the creation operator of a fermion, a+(1) = a*(r,), 
which is characterized by a complete set of quantum numbers 7;. The 
construction of the operator 7';, in terms of creation and annihilation 
operators for fermions is relatively easy, because there is only a finite 
number of bilinear combinations containing the operators a(1), a*(1), 
a(2), and a*(2) from which one can construct the most general unitary 
operator involving a(1), a+(1), a(2), and a+(2) (see Appendix 5), of which 
T 10, if it exists at all, must be a special case. One finds 


(27.50) Typ = —iel(nl2) Rus gi(ni2)Su, Td = +i enWrl2Su o-ilrl2) Ras 
with 
(27.51) Ryy = [a*(1)a(2) +a*(2)a(1)] 
(27.52) Siz = [1—a*(1)a(1)—a*(2)a(2)] 
will do the job required by (27.49). Indeed, since 
(27.53) (Siz,@(1)] = a(1); (S12, a(2)] = a(2) 
one has 
eflnl29S10 q(1) eHrI2S0 = (1) +H(/2) [S25 0(1)] + (i(or/2))® 1/21 
(27.54) x [S12,[Si2,a(1)]]+... 
= a(1)e%"!2) = ta(1), 
and similarly 
(27.55) ell7l2)Sis q(2) e~Wrl2)Sx = jq(2) 


so that, because 


(27.56) [Riz,a(1)] = —a(2); — [Riz,a(2)] = —a(1) 
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one has finally 


(27.57) 
TD yga(1) Tyg = tel?) Bu q(1) er) Rs — §fa(1) + (i(7/2)) [Ry2, a(1)] +.-.} 
t{a(1) cos (77/2) — ia(2) sin (77/2)} = a(2) 


and similarly 
(27.58) Ty2a(2)Ti¢ = ifa(2) cos (7/2) —ia(1) sin (7/2)} = a(1). 


The representation (27.50) for the transposition operator turns out to 
be equally valid for bosons, because the bilinear nature of the operators 
Ry and Sj» leads to the same C.R.s (27.53) and (27.56) if the fermion 
operators are replaced everywhere by the corresponding boson operators, 
Ty2 and 81» (say). The decisive difference between the two cases is that 
Ty» has with fermion operators only the eigenvalue —1 and with boson 
operators it has only the eigenvalue +1. 

For fermion operators S;. and Ry, satisfy the relations 
(27.59) 

Sigt+ Rig = 1; Ry2S1o = 0; Riz = Rye; Siz = Sis 


so that one can write 
(27.60) e7/2)Six — | +4849 8in (77/2) +S3, [cos (7/2) —1] = 1478 ,.—S?, 
‘ e722) Bx — 144R,.— R% 


and thus, for fermions, the operator of transposition can be put in the 
somewhat more transparent form 


(27.61) 
Te = Sig+Ryy = 1—a*(1)a(1)—a*(2) a(2) +a*(1) a(2) +a*(2)a(1). 
The factor +7 in (27.50) is now justified after the event, because 
(27.62) Ti. = Sig t+ RR, = 1 


requires this convention of phase. 
No such simplified representation seems to exist for bosons. 


NOTES 


The importance of permutation symmetry in the quantum mechani- 
cal description of multiple particle states was apparently first appreciated 
by Heisenberg [1]. A comprehensive treatment of the application of 
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permutation symmetry to the classification of multiple photon states is 
given in the book by Akhiezer and V. B. Berestetskii [2]. 

The explicit representation of the operator of transposition of spin 
labels for j = 4 is due to Dirac [3]. 
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SECTION 28 


Some Consequences of Symmetry under Particle Conju- 
gation and Time Reversal 


Whenever a particle differs from its antiparticle by an attribute, such 
as the fermion number F,, a state describing such a single particle or anti- 
particle, ie. a state with definite 7, can never be an eigenstate of the 
operator of particle conjugation I’, because by definition of the particle 
conjugation operation the operators J" and F' anticommute, 


(28.1) IF+FT = 0 


which is another way of saying that F changes sign under the operation 
I. For fermions of spin } satisfying Dirac’s equation there is an addi- 
tional complication arising from the antiunitary nature of I’, which had 
been established in Section 19. It is therefore not meaningful to introduce, 
in analogy to parity, the concept of “conjugality’’ meaning the eigen- 
value of I’, as long as only single particle states are considered. 
However, in the special case of a many-fermion system whose total 
fermion number vanishes, i.e, in a system consisting of an equal number 
of fermions and antifermions, the state vector can become an eigenstate 
of I’, [see the remark following Eq. (15.53)], 
(28.2) I|F = 0> = O|F = 0> 
and such states will be called “states of specific conjugality C.” The value 
of C depends in a rather intriguing way on the orbital angular momentum 
quantum number / and the spin quantum number s of the state, through 
the intervention of permutation symmetry. 
To bring this out consider as an example a positronium atom in the 
center of mass frame, so that its state vector can be described by 


(28.3) |positronium> = |k = k,—ke,s;,2,L; = +1,L, = —1) 


where k;, s;, L; refer to momentum, spin quantum number, and lepton 
number of the respective particle. Now the operation of particle conju- 
gation amounts in this case, by definition, to the transposition operation 
as far as the particle labels on L are concerned, 


(28.4) I =7,,(L). 
261 
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Transposition of particle labels on s;, on the other hand, has the eigen- 
value +1 for the triplet states s,-+s, =s = 1 and the eigenvalue —1 for 
the singlet state s, +8, = 8 = 0 (see Appendix 2, Eqs. (A2.9)]. 


(28.5) Py2(8) = (—1)"7. 


Finally, transposition of particle labels on k; is equivalent to the operation 
of coordinate inversion J7, and has in the center of mass frame the value 


(28.6) Ty2(k) = (—1). 


If the attributes k, s, and L forma hee set, then the operation of 
total transposition of particle labels 7',, is the product 


(28.7) Tyo = Tyo(k)Ty0(8) Tie(L) = (—1 YT. 


Since fermions of spin } satisfy the exclusion principle, 7';. must have the 
value — 1, and one can conclude 


(28.8) O(positronium) = (—1)'**. 


This conclusion is not affected by introduction of a second lepton 
number L,,, as had been done in Section 17, because electron and positron 
have the erat lepton number L, and therefore 7',, (L,,) = +1, so that 
(28.8) remains valid in this case. 

Tf one assumes that the electromagnetic interaction (23.4) is invariant 
under the operation of particle conjugation, so that 


(28.9) TA, j,07? = Ayjys 


one arrives at a number of selection rules which can be used to test this 
assumption experimentally. Although the concept of particle conjuga- 
tion was originally only defined for objects with distinct particles and 
antiparticles by insisting, in accordance with (19.66), that the electric 
current density operator change sign, 


(28.10) Tj,.t ="=j,, 
the invariance requirement (28.9) imposes a definite transformation 


property on photon states under particle conjugation, because the 
operator A, must now also transform as 


(28.11) rA,T = —A,. 


Now a single photon state |1,> is obtained, up to some factor, from the 
photon vacuum state |0,> by application of the operator A,,, which by 
(18.90) contains photon creation and annihilation operators linearly, 


(28.12) jl) = A,,|0,> 
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so that 
(28.13) I'|l,>) = PA,|0,) = FA, T|0,> = —A,TI0,. 


With the convention, implicit already in the treatment of positronium 
above, that the vacuum state is a state of even conjugality, one con- 
cludes a single photon state has odd conjugality, 


(28.18a) T\1,> = O|1,> with O(1 photon) = —1. 


By induction one has then immediately the result for the n-photon state 
[n,>, 
(28.14) C(n photons) = (—1)". 


Upon comparison with (28.8) one arrives at the selection rule: 

Positronium in a triplet state (s = 1) with odd 1 or in a singlet state 
(s = 0) with even J cannot decay into an odd number of photons, and 
positronium in a triplet state with even /or in a singlet state with odd / 
cannot decay into an even number of photons. 

It is suprisingly difficult to devise a crucial experiment which would 
test the conservation of C alone in positronium decay, because if in the 
decay parity and angular momentum are conserved one has additional 
selection rules which follow from (27.22) and the classification of photon 
states. For decay into two photons, for example, one can read from Table 
28.1, which has been constructed with the classification of two-photon 


TABLE 28.1 


SeLecrion RuEs ror Decay or Posirrontum into Two Pxorons 


Decay into two photons 
by conservation of: 


Positronium 
state J P=(-1)'*! 0=(-1)*" Co) P,J,J3 
1S 0 -1 +1 Allowed Allowed 
8Po 0 +1 +1 Allowed Allowed. 
IP, i +1 -1 Forbidden Forbidden 
881, 3D, 1 -1 -l Forbidden Forbidden 
SP, 1 +1 +1 Allowed Forbidden 
1Ds 2 -1 +1 Allowed Allowed 
8Po, Fg 2 +1 +1 Allowed Allowed. 
8De 2 -1 -1 Forbidden Allowed 
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states as given in Section 27 and Appendix 3, that the *D, state of posi- 
tronium is the lowest state from which one has, by observation of the 
absence or presence of two-photon decay, a crucial test of conservation 
or nonconservation of C in the interaction responsible for this decay. 
Relations (28.8) and (28.14) lead also to a selection rule in optical 
transitions between various positronium states under emission or absorp- 
tion of single photons. Denoting the spin and orbital quantum numbers 
of positronium in initial and final states by s,/ and s’,/’, one must have 


(28.15) (-1)#"# = -(-1)*" 


if C is to be conserved in the transition, This means, in particular, that 
quadrupole transitions (l/—/= +2,s’—s=0) are forbidden in posit- 
ronium, although they exist in the corresponding spectrum of hydrogen. 
Again, experimental verification of this prediction is difficult to obtain, 
because the electrical dipole transitions (l’—/ = + 1,s’—s = 0) between 
the corresponding energy levels are allowed. In this connection, it might 
be worth noting that these forbidden lines are not excluded if posit- 
ronium is placed into a given fixed external electric field, because then 
the interaction ceases to be symmetric in the two particles of opposite 
charge, so that the expectation value of I” is no longer conserved, or, to 
use a typical phrase, ‘‘C ceases to be a good quantum number.” 

The behavior of baryons and massive bosons under particle conjuga- 
tion is complicated by the existence of yet another attribute, the isospin, 
which will be treated in Section 29, and consequences of symmetry under 
particle conjugation for these objects will be taken up there. 

Turning now to the consequences of invariance under time reversal 0, 
one might expect on first sight selection rules similar to the ones generated 
by invariance under particle conjugation on account of the antiunitary 
nature of @ which had been established in Section 16. As in the case of 
particle conjugation symmetry, it is not meaningful to introduce, in 
analogy to parity, the concept of “reversality’”? meaning the eigenvalue 
of @, as long as single particle states are considered. The analog to the 
states of specific conjugality C' (28.2), i.e. “states of specific reversality 
T”’ (say), can exist only if there were an additive quantum number, 
L,, (say), which is odd under time reversal, 


(28.16) OL, +L, 0 = 0 


and if one could realize product states describing an equal number of 
particles with the attribute L, = +1 and particles with the attribute 


Ly, = —1,s0 that such states could have indeed the property 


(28.17) @lL, = 0> = TIL, = 0). 
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Unless the speculations mentioned at the end of Section 17, where the 
second lepton number L,, was assumed to have the property (28.16) 
making for example a state consisting of one ordinary neutrino v and 
one muon neutrino v, a candidate for having specific reversality accord- 
ing to (28.17), turn out to account faithfully for reality, no such situations 
seem to be realized in nature. 

Despite the absence of selection rules analogous to the ones following 
from conservation of C and P, invariance under time reversal has some 
far-reaching and experimentally accessible consequences whenever 
transitions are observed which require a definite sequence of states in 
time. 

Of immediate interest are situations, treated in Section 23, in which 
the amplitude for transition between a final state |r’) from an initial 
state |r> is given by the matrix element 


(28.18) <1'|8|r> 

of the scattering operator S defined as the limit 

(28.19) S = U(+, —«) 

of a unitary operator U(t:,t,) which satisfies 

(28.20) 

te =FRGhilei) Sin Tino See head 


where Hj,4(tz) is the interaction Hamiltonian in the interaction picture. 
The solution of (28.20) can always be put in the symbolical form [see 
Eq. (23.16)] 


ty 
(28.21) U(ts, ty) = Plexo (-« J Hida) 
4 


where P is the time-ordering operator. 

A general consequence of the invariance of Eq. (28.20) under time 
reversal is the principle of reciprocity: 

The amplitude of transition from an initial state |7> to a final state 
|7’> is, up to a phase factor + 1, equal to the amplitude for the “reverse” 
process, in which the sequence of initial and final states is reversed and 
each state |r> is replaced by its time reversed state |r), 


(28.22) <r'|S|7> = +<r9|S|72>. 


One must not confuse the “reverse” process |r > |rp> with the “inverse” 
process |r'>->|r> in which only the sequence of states is interchanged. 


256 Concepts in Quantum Mechanics 


For example, if 7 stands for the set of quantum numbers ky, kz, ..., 
84, Sz, -.. of momenta and spins of particles A, B, ... in the initial state, 
and 7’ stands for the set of quantum numbers kg, kp, ..., 8g, Sp, ... of 
particles C, D, ... (which are in general different from particles A, B, ...) 
in the final state, one has, since both k and s change sign under time 
reversal, 


the process |ka,kp,...,84,8p,-+.> > |Ko, Kp,...,89,8p,-+->s 
the inverse process |Ko, Kp, .. .,80;8ps ++.) > 
SS ee VL) and 
the reverse process |-—Ko, —Kp,.--, —8¢, —8p,-..) > 
|—ky, —ky,.-., -S4, —8p,--.>- 


The proof of the principle of reciprocity (28.22) is expedited if one 
exploits the possibility of representing the antiunitary property of time 
reversal @ by the operation of transposition in occupation number space. 
Thus, if with a convention of phase the vacuum state is assumed to 
transform as 


(28.23) @|0> = <0| 

then any occupation state, which is raised from vacuum by application 
of the appropriate number of creation operators 

(28.24) |7> = at(ry)...a*(t,) [0D 


will transform as 


(28.25) O|7> = €,€0| a (tm)r]---4L(71)2] = €n<rr| 


where e, is a phase factor + 1 which can be chosen arbitrarily for given 
number n of particles without violating the time reversal invariance of 
the C.R.s or anti-C.R.s valid between the operators a(7) and at(r). [The 
operators a, at in Eqs. (28.25) and (28.26) may be taken to represent 
fermions and/or bosons.] The effect of @ on all dynamical variables is 
taken care of by the substitution of the time reversed quantum numbers 
7p for the original quantum numbers 7. Similarly one has 


(28.26) <r'|O7? = (Ol a(r),)...a(71) O77 = €na*[(r1)p]---@* L(t )r] |O> 
€n|T7>- 


With this understanding about the representation of @ one can rewrite 
the transition amplitude (28.18) 


(28.27) <1'|S|7> = <r'|O-1 O80 O|7> = Em €n§T7|Sx|t7> 
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where Sy is the time reversed scattering operator 


(28.28) Sp = OSO-. 
To prove reciprocity one has therefore to prove that 
(28.29) Sp =8 


follows from the invariance of Eq. (28.20) under time reversal. 
To this end remember the unitary nature of U(t2,t,) = U*(t,,te), write 
down the adjoint of Eq. (28.20) and interchange the labels ¢, and f.: 


OU (te, t 
(28,20+) ey aie 1) = W(testy) Hells). 


Addition of Eqs. (28.20) and (28.20*) gives, for the special values t, =t 
and t; = —t 


(28.30) 21279 @ Ht) UG, 1) + UG, -1) Hin( —D) 
Time reversal invariance requires that if (28.30) is true, then 
.dU z(t, —t 
(28.307) 2% Ort 5 = Up(t, —t) [Hine (t) lr + Hint( —#) |r Ur(t, —#) 


must also be true, where Up = 9U@™", etc. The antiunitary nature of 
time reversal has been used consistently, requiring, for example, that 
upon application of © to the right hand side of (28,30) the order of the 
factors U and H, both of which contain creation and annihilation 
operators, be reversed. By the definition of time reversal 


(28.31) [Hine(t) ln = Hine( —t) 

and (28.30,) may be written as an equation for Up(t, —t) 

(28.32) pe = HAyyl(t) Up(t, —t) + Up(t, -t) Hin(—t) 

which is seen to be identical with Eq. (28.30) for U(t, —t). Therefore, 
(28.33) Tati) =O —8) 


and the observation that S is by (28,19) a special case of U(t, —t) completes 
the proof of Eq. (28.29) and thus of the principle of reciprocity. 

Under special circumstances the principle of reciprocity implies a 
simple relationship between the process and its inverse process, known 
as the principle of detailed balance: 


(28.34) |<x’|S|z>|* = |<z|S|z’>]?. 
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The validity of such a principle aids greatly the statistical analysis of 
states in thermal equilibrium and the approach to equilibrium, because 
it means transitions from a state r to a state r’ can be balanced directly 
by the inverse transitions from the state 7’ to state + without having to 
invoke any intermediate states through which such a balancing might be 
effected. It must be stressed, however, that the principle of detailed balance 
is not generally true. Even in classical mechanics it is not true. For 
example, if the transition refers to a collision between two particles inter- 
acting by a force which manifests itself as a nonspherical “shape” of the 
particles, detailed balancing is not valid, as was already pointed out by 
Boltzmann. 

Some of the special circumstances under which detailed balance does 
hold in quantum mechanics will now be recorded in a number of theorems. 


Theorem I. If the interaction mediating transitions is weak and if a 
perturbation expansion of S after the recipe (24.1) is meaningful, so that 
all observable effects can be accounted for by the first approximation 


(28.35) <7'|S|r> = <r'|Si|7> 
then the principle of detailed balance is valid. 


This follows immediately from the representation of S, as the time 
integral over H,,, and from the hermitean property of Hint, 


(28.36) <r'|Hint|t>* = <r |Hint|7’> 
so that 
(28.37) |<r’|Sa|7>[? = |<7]Sa7’>]?- 


Theorem II. If the process is invariant under inversion of coordinates IT, 
and if one measures only quantum numbers which change sign under 
both time reversal @ and coordinate inversion JZ, thea the principle of 
detailed balance is applicable. 


For example, consider a process between states characterized com- 
pletely by momenta k and spins s. The principle of reciprocity requires 
then 


(28.38) 

kG, 86... |S|ka,84,---> = +<—ky, —84,---|8|—Ko, —80,---- 
Invariance under inversion of coordinates means 
(28.39) 

<(-ky, —84,..-|8|—ko, —8¢,...> = (ka, -84,--- |S|Ko, —80,---> 
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so that 
(28.40) <ko,8¢,.--|S|K4,84,--.> = +(ky, —84,...|S|ko, —80,...>. 
If the spins are not measured in either transition, one has 


(28.41) 
D |<Ko,80,--- |S[kusa-->/? = D [Ks 84,--- |S|Ko, 80). ->? 
spins spins 
which is the principle of detailed balance applied to initial and final 
states in which only the momenta are known, 

A famous application of Theorem II is the determination of the spin of 
the 7+ meson, which is obtained by comparing the cross sections for the 
reaction p +p—>d+7* and its inverse 7+ +d-—>p +p which are measured 
for given momenta without observation of spins. The ratio of the two 
eross sections depends then essentially only on factors in phase space, 
which contain as only unknown the spin degeneracy (2s, +1) of pion 
states. 

Another pair of reactions for which the principle of detailed balance, 

as required by time reversal invariance and Theorem II, has been checked 
is the photodisintegration of He*, y+He?—H*+p and the radiative 
capture of protons by deuterium, p+H*®—>He*+y. 
Theorem III, If in a reaction involving two particles in the initial state 
and in the final state the spins of the particles lie in the reaction plane, 
then detailed balance holds, provided the interaction is invariant under 
rotations in space. This is true even if parity is not conserved in the 
reaction. 


The validity of this theorem is most easily established by the following 
geometrical reasoning. Draw a graph of the process, as indicated in Fig. 
28.1, with the understanding that none of the spins s 4, 8g, 8g, 8p have any 
components perpendicular to the plane formed by the momentum vectors 
k, and kg. By conservation of momentum this plane must be identical 
with the plane formed by kg and kp. Now draw the reverse process, as 
indicated in Fig. 28.2. If the process is invariant under rotations in space, 


ait ¢ Lak 


Fia. 28.1. The process. 
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Fic. 28.3. The inverse process. 


then the amplitude for the reverse process is equal to the amplitude for 
the process which is obtained by rotating all vectors in the reverse process 
around an axis perpendicular to the plane of reaction by an angle 7, as 
indicated in Fig. 28.3, and comparison with Fig. 28.1 shows that the result 
is the inverse process. This means the amplitudes for the process and its 
inverse are equal, and therefore the principle of detailed balance holds in 
this case. Since the invariance under inversions of coordinates has not 
been invoked, Theorem III will be true even if parity is not conserved in 
the reaction. 


Theorem IV. If in a reaction initial and final state are characterized only 
by the total angular momentum quantum numbers j, m and sets of other 
scalar quantum numbers 74, Tg, ... and 79, Tp, --. Which do not change 
sign under time reversal, then the principle of detailed balance holds 
provided the reaction is invariant under rotations in space. 


This follows from the observation that under the stated conditions 
only the quantum number m changes sign under time reversal, so that 
one has, on account of the principle of reciprocity, 


(28.42) <re, Tp, ++ 5G, m|S|745 7B, ++ Jr ™> 
= $474, TR 99s —m|S|re, Tys+ergy — My 


and, therefore, if S is invariant under rotations, the matrix element on 
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the right-hand side is equal to the one obtained by a rotation of co- 
ordinates which transforms m into —m but leaves the scalars 7, j in- 
variant, 


(28.48) <ro,Tp,-- 455; m|8|r4, 72, ABEEE Huis 


= +14, TR, ++ +J,™|S|r70, Tp, ++ 5J,M>- 


Since Eq. (28.43) connects the amplitude of the process with the ampli- 
tude of the inverse process, the principle of detailed balance holds in this 
case. 

It is rather unsettling to realize how stringent the conditions are under 
which detailed balance is a valid principle, in view of the great burden 
this principle has to carry in all proofs for the existence of stationary 
states in statistical physics. It seems almost impossible to realize 
experimentally systems devoid of any magnetic interaction properties, 
and since any magnetic interaction will in general remove the degeneracy 
of energy levels due to spins, the labeling of energy levels by some spin 
quantum number s becomes practically unavoidable. But under such 
circumstances the principle of detailed balance cannot be valid, and 
therefore all proofs which invoke detailed balance for existence of 
stationary states are suspect because their applicability to reality is in 
doubt. 

Nevertheless, in the absence of detailed balance, some kind of overall 
balance for transitions seems to be required if stationary states are to 
exist at all. The question of under which conditions overall balance 
generally follows from the principle of reciprocity for physically realiz- 
able systems poses one of the unsolved problems of statistical physics. 


NOTES 


Early treatments of the selection rules for positronium following from 
symmetry under particle conjugation are contained in papers by Wolfen- 
stein and Ravenhall [1], and Michel [2]. The conjugality property of the 
photon is already implicit in the work of Furry [3]. Explicitly it seems to 
have been stated first by Gell-Mann and Pais [4]. 

A lucid discussion of the principle of reciprocity and of conditions under 
which detailed balance holds is contained in the book by Williams [5]. 
Implications of time-reversal symmetry for strong interactions have been 
studied in detail by Henley and Jacobsohn [6]. 

The proof for the principle of reciprocity given here follows closely that 
of Mandl [7]. 
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Attributes Characteristic of Objects Engaging in Strong 
Interactions 


All baryons (NV, A, 2, 2, ...) and massive mesons (z, K, ...) partaking 
in the strong interactions possess a number of intrinsic attributes not 
shared, for example, by the leptons which engage only in weak and 
electromagnetic interactions. Most prominent among these attributes 
ranks the “isospin” T which derives its name and its status as a full- 
fledged dynamical variable from a curious symmetry property of strongly 
interacting objects in an abstract ‘‘isospace,”’ resembling formally the 
symmetry under rotations in three-dimensional coordinate space, and 
leading through this isomorphism with ordinary rotations to conser- 
vation of two new quantum numbers 7’ and 7’; in processes mediated by 
strong interactions, in analogy to the conservation of two angular 
momentum quantum numbers s and s; (orj and m) in processes mediated 
by interactions invariant under rotations in ordinary space. 

The “isospace’”’ was originally conceived by Heisenberg as a con- 
venient device to distinguish the neutron state and the proton state of 
the nucleon by a dichotomic attribute 73 (say) attached to the nucleon, 
which is given the value 7, = +1 if it is found to be a proton and the 
value 73 = — 1 if the nucleon is found to be a neutron. Once this abstrac- 
tion of considering proton and neutron as but two, and the only two 
possible, states of a generic entity called “nucleon” is accepted, the 
mathematical machinery set up in Sections 1 and 2 for the purpose of 
grasping the dichotomic attribute of fermion spin can be adapted to this 
new situation by changing nothing but the interpretation of symbols. 

Thus the state of a nucleon can be written in isospace as 


(29.1) Ix> = alpy+b|n> 


where |p> and |n> are pure proton and neutron states defined as the 
eigenstates 


(29.2) |p> = (a) |n> = (;) 
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of an “isospin” operator 
1 0 
(29.8) T3 = ts ‘| 


with eigenvalues +1 and —1, respectively, so that a and 6 in (29.1) can 
be looked upon as ys functions in the isospace spanned by the vectors 
(29.2) with the interpretation that 


|a|* is the probability for finding the nucleon as a proton and 
b|? is the probability for finding the nucleon as a neutron. 
P hf gs 


In this space the electric charge Q (in units of the electronic charge —e) 
carried by the nucleon can be represented by the operator 


(29.4) Q=Urnt+D; I= (0 1) 


which has |p) and |n) as eigenstates with eigenvalues +1 and 0, respec- 
tively. The analogs of the operators o, and o_ (see Appendix 1) con- 
structed from the spin matrices o, and o2 may therefore be called the 
“charge creation and annihilation” operators, 


(29.5) 


rentintid= (Js wean 9 


They are constructed from the isospin matrices 


Oba 0 -i 
(29.6) chs (; 4 : m= (; ‘) 
and have the properties 
n> = H t-|n> = 0 
(20.7) t4|n> = |p> |n> 
T+|p> = 03 t-|p> = |n). 
Also useful are the “projection” operators 
(29.8) 


pow () 3 mewn 9 


which project out of the general state |y> the proton and neutron compo- 
nents respectively, 


(29.9) t|lx> = ap); trlx> = b|n>. 
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In the terminology developed here, the state vector needed for the 
complete description of a single nucleon will then consist of a direct 
product of two vectors |¢> and |y>, 


(29.10) |nucleon> = |p x |x> 


where |@> accounts for all dynamical attributes of the nucleon such as 
momentum and spin, and |y) is specified by whatever quantum numbers 
are needed to completely label the nucleon in isospace. In view of the 
isomorphism of the isospin operators, 71, T2, and 73 with the Pauli spin 
operators 0, o2, and og one can hardly avoid trying at this stage to 
label the nucleon state in isospace by two quantum numbers 7’ = } 
and 7’, = +4in analogy to the two quantum numbers s = } ands, = +4 
needed to completely specify an ordinary fermion spin. Instead of saying 
“there are only two nucleons” it will be said from now on “the nucleon 
has isospin 7’ = }, with two possible orientations 7’, = +4 and 7's = —4 
in isospace.” Thus the states (29.2) will be labeled 


(29.11) 

Ip =|T=4;T,= +45 [my = |T =4;73 = -b, 
with the implication that addition of isospin quantum numbers for several 
nucleons shall be effected by applying the rules governing the addition of 


ordinary angular momenta, as given in Appendix 2. 
In particular, for an atomic nucleus containing A nucleons the total 


isospin will be represented by the operator T= (2,,7.,7's) with 


(29.12) tT, =4 3 7(N) 
N=1 

so that 

(29.13) T? = 72472472 


and the total charge of the nucleus will be the eigenvalue of the operator 
(29.14) 


A A A 
Z= % QN) = 4] ¥ vals S 1] = Po+(4/2y7. 


Nel 


Since T satisfies the C.R.s governing the components of angular momen- 
tum, the conclusions of Appendix 1 are applicable, and one concludes 
there must exist in isospace a state vector |7’,7',> satisfying 
(29.15) 

P27,T,> =T(T+1)|T,7;>; ,|T,7's> = Ts|T,T5) 
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so that any many-nucleon system can be characterized by two isospin 
quantum numbers 7',7’, if the single nucleon can be so classified. 

However, as soon as one, in this fashion, admits isospin as a legitimate 
attribute of nucleons, without which any set of attributes would be incomplete, 
most profound consequences ensue for the classification of many-nucleon 
states because of the intervention of the exclusion principle. 

To illustrate this consider a two-nucleon state which will, in general, be 
the direct product 


(29.16) |r(1)7'(2)> = Ja(1)s"(2)> x [IE T(2)> 


of a state vector characterized completely by the spatial quantum 
numbers s and s’ (such as momentum, spin, etc.) of nucleon 1 and 
nucleon 2, and a state vector in isospace characterized by the isospin 
quantum numbers T and T’ of the two nucleons. The exclusion principle 
requires that the operation of transposition of particle labels 7’, applied 
to the particle labels 1 and 2 of the complete set of quantum numbers r and 
7’ result in a change of sign of the state vector, 


(29.17) T|7(1)7'(2)> = —|7(1)7'(2)> 


and therefore a given isospin state |T(1)T’(2)> by its transformation 
property under 7';, in isospace will impose severe restrictions on the 
possible quantum numbers s and s’ of the two nucleons if (29.17) should 
be valid. 

More specifically, a two nucleon system may, in accordance with the 
rules governing the addition of two spins 4 [see Appendix 2, Eqs. (A2.9)], 
belong in isospace either to the antisymmetric singlet of total isospin 
P=s0; 


(29.18) \T = 0,75 = 0> = (1/V2) (|p>|n>—|n>|p>) 


or to the symmetric triplet of total isospin 7’ = 1, 


I 


|7 = 1,73 = +1) = |p>|p> 


I 


(29.19) |Z = 1,7, = 0> = (1/V'2)(|py|n>+|n>|p>) 
|? = 1,7, = —1l = |n>|n)> 


By working in the center of mass frame, denoting the coordinate state of 
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two nucleons symmetric under transposition of particle labels as |even> 
and the antisymmetric coordinate state as |odd>, and remembering that: 
the possible ordinary spin states of a system of two nucleons are either 
the antisymmetric singlet denoted |1) or the symmetric triplet denoted 
|3>, one can write down immediately the possible totally antisymmetric 
states for a system of two nucleons, namely 


(29.20a) Jeven> x |1> x |Z’ = 1> 
(29.20b) Jodd) x |3> x |Z = 1) 
(29.20ce) Jeven> x |3> x |Z’ = 0> 
(29.20d) Jodd) x |1> x |Z = 0> 


because the symmetric isospin triplet |Z’ = 1) must be combined with a 
state antisymmetric in the combined coordinate-spin space, and the 
antisymmetric isospin singlet |Z’ =0> must be combined with a state 
symmetric in coordinate-spin space to insure the validity of (29.17). 

The terminology of calling symmetrical coordinate states |even> and 
the antisymmetric ones |odd) stems from the observation, used already 
in preceding sections, that in the case of two particles the operation T',5 
of transposition of particle labels is identical with the operation of co- 
ordinate inversion J7 in the center of mass frame. If the relative intrinsic 
parity of the particles is taken to be even, then the coordinate state has 
the parity (—1)!, being the orbital angular momentum of the relative 
motion. It follows that the antisymmetric states must be states of odd 1 
and the symmetric states must be states of even /, 

As a consequence of the particular combinations (29.20) of coordinate- 
spin-isospin quantum numbers demanded by the exclusion principle, 
both the diproton and the dineutron, which by (29.19) necessarily belong 
to the isospin triplet, cannot exist in |even>|3> or |odd)|1> states with 
respect to spatial and spin variables. The deuteron, on the other hand, 
can exist in all four combinations of spatial and spin states. 

Experimentally, dineutron and diproton, which belong to 7’ = 1, are 
not found to exist in stable states, whereas the deuteron is found to be 
stable, but only if it is in an |even>|3) state, namely the *S-state which 
belongs to 7’ = 0. The lowest |even)|1) state of the deuteron, namely the 
1§-state belonging to 7’ = 1, is not stable. These facts indicate that the 
specific nuclear interactions, giving rise to stable states among nuclei, 
depend only on the absolute value of the isospin and do not distinguish 
between the 27'+1 multiplets. Moreover, a classification of the lowest 
energy levels of nuclei (be they stable states or resonant states of positive 
energy) suggests that the lowest possible isospin 7’ results in the strongest 
bond, as indicated in Table 29.1. 

Accordingly, in all theories of nuclear interactions which start from 
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the concept of an interaction Hamiltonian, the invariance of H,,, under 
rotations in isospace is assumed, being equivalent to assuming that in 
nuclear interactions isospin is conserved. Since changing the orientation 
in isospace means changing 7’;, which in turn means changing the total 
electric charge value of the nucleus, assuming the invariance under rota- 
tions in isospace amounts to assuming the charge independence of nuclear 
forces. It is now generally believed that the actual splitting observed 
between the energy levels belonging to a given isospin multiplet can, in 
principle, be accounted for by the remaining electromagnetic interaction 
between nucleons, which is by definition charge dependent, so that the 
electromagnetic interaction Hamiltonian depends on 7’, and is thus not 
invariant under rotations in isospace. This is the classic example of a 
symmetry, valid for a strong interaction, which is ‘‘broken”’ by a weaker 
interaction. In other words, conservation of isospin is not an absolute 
conservation law, it is valid only in reactions mediated by strong inter- 
actions, but is “violated” if electromagnetic interactions are taken into 
account. 

In recent years it has become experimentally possible to substitute in 
nuclei a A particle for a neutron, and the observation of thus obtained 
“hypernuclei’”’ has turned up another piece of evidence supporting the 
view that strong interactions, to which belongs the A—W interaction, 
conserve isospin, allowing the resulting energy levels to depend only on 
the total isospin 7', which is a scalar in isospace. Since the A particle has 
isospin 7’ = 0 (it comes only as an electrically neutral particle), its sub- 
stitution for a neutron will result in the formation of a nucleus with 
isospin 7’ lowered by }. Since the isotopic number A of the nucleus is 
not changed by the substitution, one should expect a multiplet structure 
of energy levels in hypernuclei similar to that of the nuclei, but shifted 
one up in the A scale, as indicated in Table 29.2. 

This table has been confirmed by observation of the hypernuclei 
Au?, ,He*, ,He®, and ,Li’ in multiplets as predicted. One should not 
expect numerical agreement with the binding energies of the corres- 
ponding nuclear levels, because the total number of particles A partaking 
in the interaction is different, and besides the A particle differs from the 
neutron by another intrinsic attribute, the “strangeness” to be taken 
up later in this section, which may have dynamical significance. Never- 
theless, it is interesting to note that the observed binding energy of 
about 2.6 Mev for the hypernucleus ,H® is very close to the binding 
energy of about 2.2 Mev for the deuteron H*, which correspond to each 
other in the two level schemes of Tables 29.1 and 29,2, 

It is perhaps instructive to set down here the most general interaction 
Hamiltonian for two nucleons which satisfies the requirement of charge 
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independence of nuclear forces, i.e. invariance under rotations in iso- 
space. Prior to imposing this symmetry, one would have as most general 
interaction Hamiltonian an expression of the form 


(29.21) Ain, = Hy,1 11+ Ai, O1 0+ Hy O14 +H, O,0; 


where Hy, 7, are the interaction operators acting on the spatial and spin 
variables of the two nucleons if they are in the isospin state |7’,7’;>, and 
Or, 7, stands for the projection operators which project out of the general 
isospin state of two nucleons, 


(29.22) Ix> = all, 1) +0]1, 0>+e]1, — 1) +d]0, o> 
the components belonging to quantum numbers 7’, 7',, respectively. 
These operators are explicitly 
O11 = (+ 2s(1)]8+75(2)] 
sothat ©, ,|x> = a1, 1 
O10 = $42, (1)7 (2) +2,(1) Po(2) —P'5(1) 15(2) 
(29.23) sothat 0,,/x> = b|1,0> 
O14 = 2-70) 4-22)] 
sothat ©, -4|x> = ell, -D 
Oo = 4-21) P,(2) —P(1) P2(2) —P5(1) P92) 
sothat 0p |x> = d|0,0>. 


The hypothesis of rotational symmetry in isospace means to put 


(29.24) Ay, — Hy, = Ay =H, (say) 
and 
(29.25) Ay,» = A, (say) 


where H, and H, are, in general, different. In fact, one knows from 
experiment that the H, having as lowest eigenstate the virtual 
Jeven>|1> state of the deuteron must give rise to a potential energy 
which is about 4 the potential energy due to H, which has as its ground 
state the stable |even>|3)> state of the deuteron. By substitution of 
(29.23)-(29.25) into (29.21) one finds as general isospin-conserving inter- 
action Hamiltonian for two nucleons 


(29.26) Ain, = Haf3+(T(1) T(2)]} +, (4- (8) T(2)} 
= [(83H,+H,)/4]+ (Ha—H,) [1(1) 1(2)] 


which is obviously a scalar in isospace. 
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It is interesting to see in detail how the “law of conservation of 
isospin” is “‘violated” if the electromagnetic interactions are added to 
the interaction Hamiltonian. Taking, as an example, the Coulomb inter- 
action which acts only between two protons, i.e. between two nucleons 
which are by (29.19) necessarily in the |7’ = 1,7’, = 1) state, one has to 
add to (29.26) the term 


(29.27) Hom = (€*/r) O11 = (/r) {E+ UP (1) +2 (2)]+7s(1)P3(2)} 
which is no longer invariant under rotations in isospace. Indeed, 
(29.28) Hoow2:-T1 Heo #9 and HoouT 2-12 Hoo # 0 

so that 7’ is no longer conserved. However, 7’, is still conserved, because 
(29.29) Hout s— P's Hoon = 9- 


This relation guarantees conservation of electric charge by virtue of the 
general definition (29.14), which reads here 


(29.30) T,=Z-1 


where Z is the charge number of the two-nucleon system. 

In the 1940’s there were lingering doubts in the minds of many 
physicists regarding the necessity of introducing the isospin as a genuine 
attribute. After all, one may conceivably arrive at a theory of nuclear 
forces by considering from the very beginning neutron and proton as 
different particles (namely differing in electric charge) and try to obtain 
the structure of the Hamiltonian (29.26) by assuming a peculiar co- 
ordinate and spin dependence of the specifically charge-independent 
nuclear interaction. This is logically possible if one postulates that the 
interaction between two nucleons in |even)|1> and |odd>|3> states is 
different from the interaction in |even)|3> and |odd>|1> states, but is 
otherwise charge independent. 

These doubts were largely dispelled, however, when in the early 1950's 
extension of the isospin formalism to pions and their interactions began 
to account elegantly for many features of the pion-nucleon interaction 
which would have been hard to explain in any other way. More influential 
than any other single piece of evidence in turning the tide in favor of 
isospin as an acceptable legitimate attribute probably was Brueckner’s 
explanation of a general feature governing the scatterings of pions by 
protons in 1952. Experimentally there had been established at that time 
a ratio o(7*):0(7~) = 3:1 for the respective total scattering cross section 
of positive and negative pions on protons up to 300 Mev incident pion 
energy in the laboratory frame of reference. Brueckner showed how this 
empirical relation can be understood simply as a consequence of the rules 
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governing addition of isospins, if one assumes conservation of isospin to 
hold in pion-nucleon interactions. The line of reasoning runs as follows. 

Since there are three pions, 7*, 7°, and 7~, one requires in isospace a 
vector with three components to describe the charge state of a pion, 
being a linear combination of the three basis vectors 


1 0 0 
(29.31) |7™ = (\): |7> = (\ |z> = ( ) 
0 0, -1 


denoted alternatively 
|? =1,7, = +1); [Pee Dg =O [i= 175 = —1>; 


and chosen with a convention of phase, which are the eigenstates of the 
operator 


oi 
(29.32) pp={0 0 0 


0 0 -1 
with eigenvalues +1, 0, and —1, respectively. Thus pg may be taken as 
the operator representing the charge Q of the pion. By analogy with the 
operators of angular momentum belonging to j = 1, the pion may be said 
to have isospin p= 1, the remaining two components of the isospin 
operator pe given by 


1 By edorD 1 0 -«t 0 
(29.338) pr=—=(1 0 1); pe=—=li 0 -#). 
V2\o 1 0 v2\0 ¢ 0, 
There are also, in analogy to (29.5), the charge creation and annihilation 


operators 
(29.34) 


1 (2 2 0 
p+ = 4(pi tips) = 1}; p- = 4(pi1—tpe) = = (1 0 0 
; i) ey Jal! 1 | 


having the properties 

(29.35) 
ple*> = 0; — ps|m> = (1/0/2)|7*>; paar) = (- 1/2) [2 
p-|7*> = (1/V2)|7°>;— p_|n°y = (—1/V2) |x; p_|z> = 0. 
In this notation the total isospin operator T ofa system of A nucleons 


and B pions will be represented by 
ie 4 B 
ial T= 2% +) +> e(b) 
i=l k=1 
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and the operator of the electric charge of this system is 
A B 
(29.37) Q = Py+(A/2) = 4d zalt) + & polk) +(A/?) 


It must then be possible, as in the case of a pure nucleon system, to 
characterize a system of A nucleons and B pions by two quantum 
numbers 7’, 7’, which fix the isospin state as in (29.15). Demanding 
“conservation of isospin” in strong interactions is then equivalent to 
requiring that the dynamical properties of a system consisting of pions 
and nucleons are dependent on the value of 7' only and do not depend on 
Ts. 
Consider as a simple example a system consisting of one pion and one 
nucleon. By the rules of addition of angular momenta (see Table A2.1) 
one has as possible states in isospace a quartet belonging to total isospin 
T =% and a doublet belonging to T = 3, 
(29.38) 

|f =4,75 = §> = | D [1D = |p lz 
BD = (VOEDILO+(V3) |b —DILD 

= (V3) [p> [7 +(V4) [n> |2*> 
8, 2 = (V9) 4, -D LO +(V9) E11 —D 
= (V¥) [n> [2% +(V3) [p> [> 
I -D = 1 -D1L -—D = ID |-> 
|B = (V9) [4D 11, 0-(V9) [2 —D LD 

= (V4) |p> |2>—(V4) [n> |2*> 
It, -—b = (VAILD IL —D-(V9) |b —D 1, 

= (V$)|p> |= >— (V4) [n> [> 
For comparison with the experimental evidence cited above the isospin 
properties of proton-pion states will now be examined. Inspection of the 
combinations (29.38) and (29.39) reveals that a |p>|7*> state is always 
a pure isospin state belonging to 7' = 3, whereas the states |p>|7°> and 
|p>|7-> are always isospin mixtures, 

|p>|7*> = |e 
(29.40) [p> |n°> = (V3) |B > + (V5) [8 
[p> |n> = (V9) [4 — D+ (V3) -D 


(29.39) 
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Tf one now supposes that isospin is conserved in the pion-nucleon 
interaction, then there must exist, for given total energy and angular 
momentum, two probability amplitudes a(}) and a($), depending on 7 
only, which govern, respectively, the scattering of an initial pion-nucleon 
state |7’ = 4) into a final pion-nucleon state |Z’ = }> and the scattering 
from an initial state |7’ = }> into a final state |7 = 4). In particular, a 
|p>|7*> initial state, which by (29.40) is necessarily a |7' = 4) state, can 
be seattered only into a final state |7' = $) with amplitude a(3), 


(29.41) |p>|7*> = |8,2> > a(8)|2,3> = a(8)|p> [7 

whereas a |p)|7~> initial state, which is a mixture of |Z’ =} and 
|T = => states, will be scattered into a final state to which contribute the 
original |7 = 4> component with amplitude a(4) and the |7' =%) com- 
ponent with amplitude a(§), 


(29.42) 
|p>lz> = (V3) [4 —D+(V9) |b, —D 
= (V9) a(8) |2, — > +(V9) a8) |, —b 
= (4) V2[a(2) —a(4)] |n> |m°> + (8) [a(2) + 2a(8)] | p> |. 
The content of Eq. (29.42) is summarized in Table 29.3. 


TABLE 29.3 


AMPLITUDES FoR Pron-Proton ScaTrEeRING PRocEssES 


Process Amplitude 
Ip>|2+> — |p>|2*+> a(3) 
\p>|2-> — |p>|a-> (B)a(8) + (8) a(d) 
Ip>|2-> — |n>|n°> (3)V 2a(#)— (4) V 2a(2) 


Suppose now the pion-nucleon interaction favors, for as yet unknown 
reasons, the state 7’ = $ in the region below 300 Mev incident energy, so 
that a($) > a(4). Then, since the cross section for a process is propor- 
tional to the square of the corresponding amplitude, one has the relation- 
ship 
(29.48) 

o(pr* — prt):o(pa~ —> pr-):o(pn~ > n°) = |a()|*:() |a(3)|*: 

(2) laa)? 


= 9:1:2 
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which, if one lumps the processes (pa~—>p7) and (pa —>n7°) into the 
total 7~ scattering cross section o(7~), implies the ratio 

(29.44) o(a*):o(77-) = 3:1 

in accordance with observation. If, on the other hand, the pion-nucleon 
interaction had the property a(4) = a(3), then this would have been 
exhibited as a ratio 1:1:0 instead of the observed 9:1:2 and a(4) > a($) 
would have resulted in a ratio 0:2:1, which is again excluded by experi- 
ment. 

The experimental evidence favors, thus, the notion that the pion- 
nucleon interaction conserves isospin and in the energy region below 
300 Mev takes place predominantly in the state |Z’ =>. Clearly, the 
analysis does not explain why the |7 => state is favored in this way. 
From the angular distribution of the scattered pions one can also infer a 
spin dependence of the interaction, favoring the Ps). state of the pion- 
nucleon system, again for as yet unknown reasons. The cross sections 
also show a pronounced maximum near 200 Mev incident energy. One 
lumps these curious experimental facts into the phrase “there is a 
(T =4%,Ps).) resonance in the pion-nucleon system at about 200 Mev.” 
Other evidence suggests that if 7 = § then in the P state the interaction 
is attractive, whereas in the S state the interaction is repulsive (this last 
fact is summarized in the phrase ‘‘the interaction has a hard core”’), for 
reasons which also remain obscure. In short, one badly needs a theory 
which explains the features of the strong interaction between pions and 
nucleons, which, one would hope, encompasses the nucleon-nucleon 
interaction through the pions as the “glue” mediating that interaction. 
Despite ingenious patch- and guesswork by many workers, no such 
theory seems to be within sight at the time of writing. 

In absence of a dynamical theory, exploitation of the symmetries in 
isospace exhibited by the strong interaction remains the most reliable 
tool for making predictions and giving “explanations” for observed 
“branching ratios.” Further examples of such ratios are obtained by 
comparing the reactions |n>|p>—>|7°>|d> and | p>|p>—|7*> |d> which 
require 7' = 1 by isospin conservation, so that 
(29.45) o(np > 1°d):o(pp > a*d) = 1:2 


and the reactions | p> |@>—>|7+> |H®> and | p> |d>—>|7°> |He*) resulting 
in 
(29.46) o(pd + 7+ H*):0(pd > 7° He’) = 2:1. 

Conservation of isospin may also result in strict selection rules. For 
example, the reaction 


(29.47) |d> |d> + |n°> [He*> 
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is strictly forbidden by conservation of T. If one ever were to observe 
this reaction one would have to conclude it is mediated by an interaction 
which does not conserve isospin, such as the electromagnetic interaction. 
For further details the reader is referred to the extensive literature on 
this subject. 

The relationship (29.37), among the operators of electric charge Q, 
the third component of isospin 7s, and the nucleon number A summarizes 
the curious fact that there is no negatively charged nucleon, making 
Q = } the center of charge, so to speak, for the nucleon, whereas the pion 
can exist in three charge states around Q = 0 as center of charge. This 
relation can be extended to include antinucleons, if one introduces the 
baryon number B which is given the value + 1 for the nucleon, — 1 forthe 
antinucleon, and 0 for the boson, because existence of two and only two 


antinucleons, the antineutron 7(Q = 0) and the antiproton p(Q = —1), 
ean be summarized by writing 
(29.48) Q-T1,—(B/2) = 0. 


This equation implies an assignment of isospin quantum numbers 7’,7'; 
to antinucleons, following the conventions employed for nucleons and 
pions, by associating a declining sequence of numbers 7’; with the declin- 
ing sequence of charge values Q. Thus the isospin states of antineutron 
and antiproton are labeled |x) = |7=4,7,;= +4 and |p>=|7 =}, 
T3= —}. 

In 1953 Gell-Mann and Nishijima showed how one can accommodate 
the curious displacements of the center of charge found empirically 
among the so-called ‘strange’? hyperons and kaons by introduction of 
yet another attribute, the “strangeness’’ S, defined as 


(29.49) S = 2(Q-7,—(B/2)] 


resulting in the now famous classification scheme of baryons and massive 
bosons laid out in Table 29.4. In this scheme the assignment of baryon 
number +1 to baryons, —1 to antibaryons, and 0 to bosons is assumed. 

An alternative, but equivalent, scheme which has gained some popu- 
larity in recent years is obtained by introducing instead of the strangeness 
S the “hypercharge”’ Y defined as 


(29.50) Y =S+B 


resulting in a possible classification of particles according to isospin T, 
hypercharge Y, and baryon number B. For the discussion of conservation 
laws it is, however, most convenient to adhere to the strangeness concept 
as defined in Eq. (29.49). 
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Since there is quite overwhelming evidence for the separate conserva- 
tion of electric charge Q and baryon number B in all interactions, con- 
servation of isospin in strong interactions will result in conservation of 
strangeness. For reasons as yet completely unknown, experiments reveal 
conservation of strangeness (and thus of 7's) in all processes mediated by 
strong interactions, and nonconservation of strangeness in weak inter- 
actions. Further, there is some experimental evidence indicating, again 
for unknown reasons, charge independence (meaning conservation of 
both T and 1’,) in all strong interactions involving baryons and massive 


TABLE 29.4. 


Tue Scueme or GELL-MANN AND NISHIJIMA 


. Wall 2 =a 0 +1 +2 
Ts \ 
0 0 Ae 70 
' 
+ =e Ke pe et 5 
$ i 
=t &- ie n | @p Ko Fo 
t 
1 zt at Pd 
1 Px) 7? Zo 
= = 7 be 


bosons. In addition, there is some evidence for existence of two selection 
rules, namely |47's| = 1 and |47'| = }, governing the isospin-noncon- 
serving weak interactions, whose origin is also obscure. 

Even supposing one can understand conservation of charge @ as a 
consequence of a gauge-invariance principle, as indicated in Sections 20 
and 21, there remains for discussion the empirically very well-established 
conservation of baryon number B, and raises once again the question of 
whether this conservation law is not in fact a superselection rule, 
generated by invariance under an antiunitary symmetry operation, such 
as combined inversion S or time reversal @, which have already been 
recognized in Section 17 as possible origins for the analogous conser- 
vation of lepton numbers. 

To settle this question, an examination of the transformation proper- 
ties of isospin T and baryon number B under the operations of coordinate 
inversion IJ, time reversal and, in particular, under particle conjugation 
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T'is required, because once isospin and baryon number have been ad- 
mitted as legitimate quantum numbers, the complete representation of 
each operator will consist of a direct product of the representations in the 
various subspaces needed to accommodate the complete set of quantum 
numbers. For example, the operator of particle conjugation for a baryon 
state will consist of the direct product 


(29.51) Dyaryons = Up x Iso x Tyg x I’ 


where I", and I's¢ refer, respectively, to the representations in momentum 
space and in spin-chirality space already given in Section 19, and I’;y 
and I’; refer to the representations in isospace and baryon number space 
which will now be established. 

To carry as far as possible the analogy with the corresponding treat- 
ment for leptons given in Section 28, both nucleon and antinucleon states 
will be represented as four-component vectors in the product space 
spanned by the simultaneous eigenstates of 7'; and B. Thus, if one 
introduces with a convention of phase as basis the four states 


(29.52) 


1 0 0 0 
0 1 2 0 <, 0 
I={of Im=[o)) Im=(ah D=] 4g 
0 0 0 =i) 


one has for isospin, baryon number, and electric charge in this combined 
isospin—baryon-number space the representations 


(29.53) 
0100 
1/1 0 0 0 
Pr=imxl=515 9 9 1 |s Pea dexls 
Om On ate 0 
f,=4r3xl;  T? = 3x1 
Ltt a as 
reo il 
(29.54) B= Ix(, aah Geipmgt? vig 
; co Oe 0 
1y50)5,010 a0 
end O's OF tO 
(29.55) Q@=T.+(B)2)=(5 9 9 0 
Ot @ an 
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It is readily seen that T, B, and Q all commute, as required if 1, T?, B, 
and @ are to be diagonal simultaneously. 
Now the operator of particle conjugation I should have, by definition, 
the properties 
(29.56) : 
Tp) =|; Ti =|); Ti =|; TP = |p» 
implying the space spanned by (29.52) the representation 


Om Oat) coat 
On Oe leene 

(29.57) Dail peaeoe  eauaenis: 
10 200 


As in the corresponding case of a system composed of leptons and anti- 
leptons, it is not possible for a system composed of nucleons and anti- 
nucleons with eigenvalues B 4 0 and/or Q 4 0 to be in a state of specific 
conjugality, because J” anticommutes with both B and Q, 


(29.58) TB+Br=0; TQ+@Qr=0. 


If, on the other hand, B = 0 and Q = 0, for example in an object com- 
posed of a proton and antiproton, the system may be in an eigenstate of 
I. This is true even though I’ may not be representable as a unitary 
operator, for the general reasons stated in Section 15 following Eq. 
(15.53). However, the conjugality properties of baryons are set apart 
from the conjugality properties of leptons by a complication arising from 
the existence of isospin as an attribute needed to characterize baryons. 

At this point in the development it becomes essential to distinguish 
between the possibilities of representing I’ either as a unitary or as an 
antiunitary operator. 

On the assumption that I can be represented by a unitary operator, Lee 
and Yang in 1956 first analyzed the particle-antiparticle symmetry of 
objects possessing isospin by a line of reasoning which will now be re- 
traced here. The conclusions reached by these authors should, however, 
be approached with caution, until the consequences of the alternative 
representation of J” by an antiunitary operator have been scrutinized, as 
will be done later in this section. If the unitary representation of I in 
isospin-baryon-number space is denoted I’, then the C.R.s follow from 
the representations (29.57) and (29,53) (assuming I’y not to contain 
an operation of complex conjugation as would be necessary if were to be 
antiunitary) : 


(29.59) 
Iy?,+2, Tp = 0; I'ygf,-T.Ty = 0; TyPst+P sly = 0. 
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Therefore, a system composed of baryons and antibaryons can never be 
in a simultaneous eigenstate of I, and 7’, 7's. For example, the eigen- 
states of 7’, T's for a nucleon-antinucleon pair are (note the phase con- 
vention implied by (29.52)) 


[1,1> = |p|” 

(29.60) T=1:} |1,0> = (1/72) (|p>|p>—|n>|n>) 
[1, -1> = |»>|D> 

(29.61) T =0: |0,0> = (1/v2)(|p>|B> +|n>|%. 


The two possible eigenstates of I’, namely |p>|p> and |n>|m>, are 
therefore necessarily mixtures of states belonging to 7’ = 0 and 7’ = 1, 


(29.62) 

|p>|B> = (1/2) (|0,0>+|1,0>); —_|n>|m> = (1/2) (|0, 0>—|1, 09). 
Despite this impossibility of using eigenvalues of I'y to characterize the 
conjugality properties of a state with given quantum numbers 7’, 7';, Lee 
and Yang pointed out that there is an operator involving I’ which can 
be used for this purpose, namely 
(29.63) 

@ = Texp (inf 2) = Pp x Tso x Tyexp (tna) = Tp x Msc x Gy 
consisting of a rotation by angle z around the 2-axis in isospace combined 
with the operation of particle conjugation. @ has the commutation 
properties 
(29.64) GIG = 0: 

(29.65) GB+BG@ = 0, 
can therefore be diagonalized simultaneously with 7,73, and a state with 
B= 0 can be an eigenstate of G, called a state of specific G conjugality. 


To obtain an explicit representation in the space (29.52), one utilizes 
the relations, valid for 7’ = 4, 


(29.66) Pete ie, and) Pe gy 
which follow from (29.53), yielding 
(29.67) 

Gy = Py [cos (n/2) 1 + 2i sin (7/2) Po] 


000 -1 pl 0 0 

0 1 Oo}; -1 0 0 0 
OQ Fle 0 0 0 0 Ont 
-1 00 0. 00 -1 0 
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only 40. OronaT 
~|-1 00 oF 
0 -1 0 0 
Single particle states transform thus under G as 
(29.68) 
Gulp) = —|R>;  Guln> = |B; Gul™ = |p;  Gu|D> = —|n). 
@y is by definition unitary and satisfies 
(29.69) @=-I for B= +1, 


Since in a system containing many particles and antiparticles T and B are 
additive, whereas G is multiplicative, one has generally 


(29.70) 
GT-TG=0; GB+BQ=0; G* = 0G =(-1)? 


provided C? = +1, where C denotes the eigenvalue of I"p x Isc, and is 
identical with the ordinary conjugality introduced in Section 28. 

Consequently, states with B = 0 can be assigned specific G conjugality. 
Components of the same isospin multiplet 7’ have always the same con- 
jugality Gy. For example, one reads immediately from (29.68) the Gy 
conjugality of the nucleon-antinucleon states (29.60) and (29.61), as 
summarized in Table 29.5. Since for the ordinary conjugality C the same 
analysis applies as in the case of positronium (see Table 28.1), the total 
G conjugality of a nucleon-antinucleon pair is given by 


(29.71) G = OGy = (-1)**(-1)?. 


From the invariance of strong interactions under particle conjugation 
and under rotations in isospace, the conservation of G in all processes 
mediated by strong interactions now follows. As the concept of G conjugality 
can be extended to pions and strange particles, a number of interesting 
selection rules are engendered by this conservation law. To obtain some 
of these consider next the conjugality properties of pions. 

Since the ° decays into two photons, and since the interaction respon- 
sible for this decay is assumed to leave the conjugality C invariant, it 
follows from (28.14) that the state |7°> must be a state of even conjugal- 
ity, 

(29.72) T|n°> = +|7°. 


There is no need to distinguish between I" and I’y here, because pions 
have spin 0 and one can represent I’ in coordinate space by the identity 
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TABLE 29.5 


Tue Gy ConsuGatity or NUCLEON-ANTINUCLEON Pairs 


T Ts State Gy 
1 +1 [p>iny -1 
0 (1/¥2)(Ip>[p>—|n>|%>) -1 

—1 — |n>Ip> =1 

0 0 (1/¥2)(|p>P>+|n>|7>) +1 


operation. The charged pions, on the other hand, may be considered as 
each other’s particle conjugate, so that I’ satisfies 


(29.73) T|n*> = |r) and Pia = |x. 


This implies, in the isospace spanned by the vectors (29.31), the repre- 
sentation 


OmOm =a 
(29.74) ies || iw 
-1 0 0, 


and the C.R.s of I’ with the operators of isospin (29.32) and (29.33) are, 
as in the case of the nucleon-antinucleon system, 


(29.75) 
PP lr =.0 
rt,-?,T =0 
Te BREEU TD es ie. TQ+QIT = 0 for pions, 


giving rise once again to the construction of an operator G defined as in 
(29.63), which will still satisfy (29.70), and which can be represented, 
because of the relations (valid for 7’ = 1) 
(29.76) 

Pet 7, for n> 0; P= 7 for n>1 
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in the isospace of the pions (29.31) as 


(29.77) 
G@ = Texp(in?,) = P{1 +isin (7). + [cos (m) — 1] 73} 


1 0 0 10 -l 
rf( n)-( 02 i} 
OO -1 0 1 

00 -il\ /0 0 1 -1 0 0) 
( One }(0 -1 o) = ( 0 -1 ). 
-1 0 0/ \L 0720 0 0 -1 


This means all pion states are eigenstates of G; the pion has odd G con- 
jugality. 

If now @ is conserved, a number of selection rules can be inferred 
immediately by consulting Eq. (29.71), for example: 


(I) The system |p>|n> which belongs necessarily to isospin 7’ = 1 
cannot decay through strong interactions into an (°¥fP) number of pions 
from a state with 1+ (°¥f) such as the 


(ie "Po, + ) 
*S1, ‘Pi, 
state. 

(II) An even number of pions cannot by strong interactions go into 
an odd number of pions, and vice versa. 

Since Eq. (29.49) is invariant under particle conjugation, strangeness 
anticommutes with the operator Iy, and extension of the foregoing 
consideration to strange particles requires incorporation of the additional 
CR. 


(29.78) GS+SE =0 
leading to 
(29.79) @? = (-1)8+8 


and the conclusion that only systems with both B = 0 and S = 0 can be 
states of specific @ conjugality. Further details can be found in the 
comprehensive work of Goldhaber, Lee, and Yang, quoted in the 
references at the end of this section. 

The treatment of the conjugality properties of particles possessing 
isospin, given above, has the defect of not taking into account the anti- 
unitary nature, already recognized in Section 19, of the particle- 
conjugation operator I’ applied to fermion states governed dynamically 
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by the Dirac equation. The consequences of this complication have never 
been fully analyzed, because the transformation properties of isospin T 
under particle conjugation are not known with certainty, and one can 
infer from the invariance of (29.49) only that 7’, must change sign under 
I’. Suppose, for example, all isospin components anticommute with the 
correct antiunitary operator of particle conjugation, denoted I",, 


(29.80) r,T+iT, = 0. 


By analogy with the corresponding property (15.14) of the antiunitary 
operator of time reversal 9@—with respect to ordinary spin J—I’, can be 
represented in isospace by the operator 


(29.81) (P'4)rs = [exp (in? 2)] K. 


Whether invariance under I", engenders a superselection rule or not 
depends on whether [} = —I or '{ = +J. Now the operator I", for 
baryons is the direct product 


(29.82) Ty = l'pxTso x tg 


of the representations in coordinate space, spin-chirality space, and 
isospace. Since in the combined coordinate-spin-chirality space, 
according to Eq. (19.69), 


(29.83) (Tp x Isc)? = +12 


and since it follows from (29,81), as in the analogous case governing the 
representation of time reversal, that 


-I if P=}4,... 


29. Y Ne 2 = 
(29.84) [(L4)is] la # 1 =0,1,... 


one concludes 


) 2 | —I for nucleons and & hyperons 
(29.85 js 


+I for A and & hyperons. 


This means invariance under particle conjugation engenders a super- 
selection rule only for nucleons and & hyperons. Moreover, since in- 
variance under J’ does not hold in weak interactions, the attribute 
associated with this superselection rule will be conserved only in strong 
interactions. An attribute which fits this description is the hypercharge 
Y defined in (29.50), because Y is unequal zero only for nucleons and 
& hyperons and is conserved in strong interactions only. It is thus not 
unreasonable to conjecture that conservation of hypercharge is, in fact, 
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due to a superselection rule generated by invariance under particle 
conjugation. 

The invariance under combined inversion 2, = JJI"4, on the other 
hand, since by Eq. (19.71) for fermions 


(29.86) (Sas Dee) ead 


will yield a superselection rule only for A and 2 hyperons, because 
D4 = Up x Ugo x Tyg satisfies (assuming JJ? = +J in isospace) 


(29.87) 


(2)? | +2 for Te $d: 
"= 


-I for 7 =0,1,... ie.fordand Zhyperons. 


One is thus led to conclude that A and & hyperons possess an attribute 
not shared by the other baryons, which is conserved through a super- 
selection rule in all interactions invariant under combined inversion, 
thus including, presumably, the weak interactions. The nature of this 
attribute, if it exists, is obscure.* 

In any case, the conservation of baryon number (an attribute shared by 
all baryons) valid in all interactions, cannot be understood as a super- 
selection rule engendered by particle-conjugation symmetry. 

This leaves as the only symmetry which may be made responsible for 
the conservation of baryon number through a superselection rule the 
invariance under time reversal @ as had been suggested at the end of 
Section 15. 

Fortunately, the conclusions reached earlier regarding selection rules 
following from conservation of @ conjugality in strongly interacting 
systems with B = 0 and § = 0 are not invalidated, if the two-dimensional 
baryon-number space is identified with the two-dimensional reversality 
space spanned by the baryon states |B) and their time reversed analog 
6|B>. In particular, the representation (29.57) for the operator of 
particle conjugation can be looked upon as the direct product 


wn ro B( 9, 


made up out of the representation in isospace and in reversality space 
in analogy to the representation (15.44). 


* There is one more symmetry operation (valid only for strong interactions), the “ weak 
reflection,” 170, which is antiunitary and whose square is yay1ysK yay1ysK = —1 for 
all fermions. The nature of the corresponding attribute, which ought to be conserved in 
strong interactions by a superselection rule, is equally obscure. 
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The conclusions reached in this work regarding possible superselection 
rules engendered by antiunitary symmetry operations and their possible 
interpretation are summarized in Table 29.6. 


TABLE 29.6 


List or PossinLE ATTRIBUTES CONSERVED THROUGH A SUPERSELECTION RULE 


Symmetry operation gene- Conserved 
rating superselection rule Applicable to: attribute Validity 
Time reversal @ Leptons Muon number L,, —_Alll interactions 
Baryons Baryon number B All interactions 
Combined inversion Y= III Leptons Lepton number LZ All interactions 
Aand 2 ? All interactions 
hyperons 
Particle conjugation I” Nucleons and Hypercharge Strong interactions 
& hyperons Y=B+S 
Weak reflection 70 Fermions ? Strong interactions 
NOTES 


Brueckner [1] turned the tide in favor of isospin as a legitimate 
attribute. 

Fermi’s article [2] contains a complete treatment of the isospin 
formalism and its application to problems involving nucleons and pions, 
as well as references to earlier work on this subject. An analysis of the 
energy levels of hypernuclei in terms of isospin labels was given by 
Morrison [3]. 

For details on branching ratios and selection rules following from 
isospin conservation see the lecture notes by Sakurai [4]. The concept of 
strangeness as a quantum number is due to Gell-Mann [5]. See also 
Nishijima [6]. The significance of hypercharge as a quantum number 
was pointed out by d’Espagnat and Prentki [7]. The concept of G 
conjugality, although already used by Michel [8], was introduced 
generally by Lee and Yang [9]. A comprehensive treatment of selection 
rules following from particle conjugation symmetry for systems of 
vanishing baryon number B is contained in the paper by Goldhaber 
et al. [10]. 

Robertson [11] has analyzed some consequences of the assumption 
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that the anti-unitary operator of particle conjugation anticommutes 
with the isospin operator, and that isospin is invariant under reversal 
of motion. 
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*SECTION 30 


The Quasi Particle Concept 


Some of the most startling manifestations of the quantum mechanical 
principles which apparently govern the actual physical world are 
macroscopic phenomena encountered in liquids and solids at low tempera- 
tures, known as “superfluidity” and ‘“‘superconductivity.”” They can be 
understood by application of the principle of superposition of probability 
amplitudes in conjunction with the permutation symmetry characteristic 
of many-particle states. Although a theory of supertransfer phenomena 
could have been developed immediately after the inception of quantum 
mechanics, and although Hinstein provided, through discovery of the 
“Einstein condensation” of bosons, an important clue to the under- 
standing of superfluidity even before the meaning of the superposition 
principle had been fully appreciated, it took more than twenty years 
before the now generally accepted ideas needed to explain these pheno- 
mena assumed shape. This is rather surprising, after the event, and may 
perhaps partly stem from a tendency of physicists, raised on a diet of 
classical notions regarding the particle concept, to relegate applicability 
of quantum mechanics to a strictly microscopic domain. 

A substantial intrusion of quantum mechanical concepts into the 
macroscopic domain should be expected, for objects in a state of low 
temperature containing many particles, as a result of quite elementary 
considerations. Take, as an example, a liquid made up out of n bosons 
enclosed in a volume V. To account for the low compressibility of fluids, 
in general, assume for simplicity’s sake as the only interaction between 
any two bosons a “‘hard core” potential as indicated in Fig. 30.1. In the 
liquid state each boson will be hemmed in by its neighbors so that it 
occupies a volume of order (V/n) ~ rj. Consequently, each boson will 
have in the state of lowest energy a rather high “zero-point energy” of 
order 


(30.1) €o ~ (kg/2m) ~ (1/2mrp) = (1/2m) (n/V)°9 


and the spacing between the ground state and the first excited state of a 

single boson will be of the same order of magnitude. Therefore, if the 

temperature 7’ of the liquid is less than a finite critical temperature, 
289 
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identical with Einstein’s “condensation temperature” (natural units 
with Boltzmann’s constant & = 1 are used) 


(30.2) T < Ty) ~ (1/2m) (n/V)?8 


practically no single-particle excited states will exist in the liquid. By 
putting the known density and atomic mass for liquid helium into 
(30.2) one finds for 7’) a value of about 3°K. 

On first sight one might conclude from this that there is no mechanism 
by which the fluid can take up energy below 7'o, resulting in a practically 
vanishing specific heat at temperatures below 7’). Upon second thoughts 


fo 


Fic. 30.1. The “hard core”’ potential. 


it will be realized, however, that there is a mechanism for imparting 
energy to the fluid by excitation of collective modes, such as sound waves, 
which may be described quantum mechanically by admitting single 
particle states which are linear superpositions of ground and excited states, 
the amplitude of the ground state being dominant and the excited states 
represented by small admixtures only. The spacing of energy levels 
characteristic of these collective modes will then be determined by the 
size of the entire volume V, and one anticipates an almost continuous 
spectrum of “phonons” in analogy with the practically continuous 
spectrum of photons representing black body radiation enclosed in a 
volume V. By this analogy the specific heat of the liquid should be 
expected to depend on temperature 7’ and the velocity of sound ¢o 
according to 


(30.3) (dB|dT) ~ (T°/c8). 


This result is borne out by observation in liquid helium below 0.6°K, 
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with an accuracy astonishing in view of the simplifying assumption 
made above about the intermolecular forces. 

The critical temperature 7'y (30.2) has another significance which leads 
one to expect drastic consequences of the restrictions imposed by 
permutation symmetry on state vectors describing n indistinguishable 
particles at temperatures 7'<7'y. The measurement of the number of 
particles n, in the classical sense of counting spatially separate objects, and 
the measurement of the temperature T, are incompatible at temperatures 
T <T4, for the following reason. If one wishes to “count,” in the classical 
sense of spatial separation from its neighbors, a particle in a volume V 
occupied by n particles, one must confine it to a region of volume less 
than (V/n). This corresponds to a localization of coordinate 


(30.4) Aq < (Vin), 
and an uncertainty in the knowledge of the particle momentum 
(30.5) Ap & (1/49) 2 (n/V)"8 


is engendered by the uncertainty relations (10.5). Thus, there will be an 
uncertainty in the knowledge of the energy of each particle 


(30.6) Ae ~ [(Ap)?/2m] & (1/2m) (n/ VP 
corresponding to an uncertainty in the knowledge of the temperature 
(30.7) AT ~ de 2 To. 


One may look upon this intrusion of specifically quantum mechanical 
features into the behavior of physical objects at temperatures 7’ < T'g 
from yet another point of view which might be instructive. Suppose one 
is in possession of that supremely intelligent agent known as “ Maxwell’s 
demon,”’ and instructs it to separate, by judicious opening and closing 
of a door of diameter d between two compartments of V, the fast and the 
slow particles in a gas consisting of n particles (see Fig. 30.2). The 


eo 


Fra. 30.2. The door operated by Maxwell’s demon. 
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particles are assumed to have the usual velocity distribution of average 
velocity 3 and width ~ (7'/m)"*. If the little fellow is instructed to 
separate particles with v >@ from particles with v < %, then his un- 
certainty in the knowledge of the velocity of each particle must be less 
than the natural width of the velocity distribution, 

(30.8) (T/m)2 = Av. 


The uncertainty relations impose a lower limit on 4v, however. During 
time 7 the door of area d? will be bombarded by approximately (n/V)d? dr 
particles. If the demon wants to let a particle pass, he must not let the 
door be open longer than the average time 47 between successive arrivals 
of particles, 


(30.9) Ar ~ (V|nd?3) 


engendering thus an uncertainty in the knowledge of the kinetic energy 
of the particle of order 


(30.10) Ae & (1/Ar) ~ (nd23/V) 

corresponding to an uncertainty in the knowledge of the particle velocity 
(30.11) Av ~ (Ae/mv) 2 (nd?/mV). 

The door itself produces an uncertainty because of the localization of the 
particle to diameter d during passage, 

(30.12) (Av), 2 (1/md). 


In order that (4v), will not be larger than (30.11), the opening d should 
have at least the size 


(30.13) d > (1/mAv). 
By substitution into (30.11) one obtains the inequality 
(30.14) Av 2 (1/m) (n/V)¥8 
and therefore from (30.8) 
(30,15) T 2 (i/m)(n/V)P ~ To. 
This means: Maxwell’s demon can begin to do his job only if the tem- 
perature of the gas is at least equal to the characteristic temperature 
Ty. Below 7'y his intelligence will be completely frustrated by the uncer- 
tainty relations. 

The foregoing considerations will not be qualitatively affected if one 


replaces the crude hard core potential by a more realistic interaction 
including an attractive potential well, as indicated in Fig. 30.3. In one 
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Fic. 30.3. Realistic interaction potential between two bosons. 


respect, however, an attraction in addition to the repulsive core between 
two bosons should lead to an important new feature. The state represent- 
ing a physical situation in which all single particles are in their ground 
state need no longer be the state of lowest energy for the entire system. 
Presence of a few phonons can minimize the average potential energy 
between particles such that the increase in average kinetic energy 


n(k) 


Fig, 30.4, Distribution of bosons in momentum space at absolute zero of temperature 
in the presence of an interaction of the type of that shown in Fig. 30.3. 


occasioned by these collective modes is more than compensated. Thus 
even at the absolute zero of temperature the ‘‘ Einstein condensation”’ 
into the ground state, represented by a 6 function in momentum space 
in case of the ideal gas, will be modified by the actual interaction into a 
distribution spread out somewhat as indicated in Fig. 30,4, The expec- 
tation that in the absence of attractive interaction the ground state of 
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theentiresystem would be the state in which all particles are “condensed” 
into the state k = 0 (apart from a zero-point energy which will from now 
on be taken as the origin of the energy scale), whereas “switching on” 
the weak attractive interaction results in a ground state for the entire 
system in which the single particle states acquire admixtures of excited 
states with k 4 0, is borne out by the following more rigorous argument 
due to Bogoliubov. 

A system of weakly interacting bosons, subject to the usual conser- 
vation laws and the condition that the total number of bosons is a given 
constant and that all interactions can be accounted for entirely by specific 
two-body central forces, can be described by the Hamiltonian 


(80.16) H = 5 o(k)b*(k) b(k) 
+O DED PKK’) Ser tee et B*(k”) b+ (k") b(k’) b(k). 


wie 
The bosons are assumed to have no intrinsic properties and to be charac- 
terized completely by the momentum quantum numbers k. 


It is perhaps worth elaborating a bit, parenthetically, on how the various conditions 
imposed phenomenologically on the actual system enforce the specific form (30.16), 
Considering first a set of bosons without interactions, one could begin in a more systematic 
vein by considering the most general bilinear expression 


(30.17) Ho = © D w(k’,k)b+(k’) b(k) 
kK k 


which will guarantee conservation of particle number because it contains for each anni- 
hilation operator exactly one creation operator, and has no matrix elements between 
states of a different total number of particles. In the language of Feynman graphs, each 
term of Ho can be represented as indicated in Fig. 30.5. Imposing now one by one the 


AK 


a 


Fia. 30.5. Graph for (30.17). 


various invariance requirements on Ho, one arrives at successive restrictions imposed on 
the function w(k’,k), namely 


(i) Invariance under reversal of motion requires w(k’,k) = w*(—k’, —k) 
(ii) Invariance under inversion of coordinates requires w(k’,k) = w(—k’, —k) 
(iii) Invariance under displacement in time requires w does not contain ¢ explicitly 
(iv) Invariance under displacement in space requires w(k’,k) = w(k’, k) dy, 
(v) Invariance under rotations in space requires w(k’,k) = w[(k’:k)] 
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and one obtains the usual form for the Hamiltonian of a system of noninteracting bosons 
(30.18) Ho = X & of(k’+k)] dy Ot (k’) b(k) = XZ w(k) b+(kk) b(k); w real. 
Kk k 


The analogous expression for noninteracting fermions is similarly obtained. 

Tf all Hamiltonians which are not of the form (30.18) are called “interaction Hamil- 
tonians,” then all possible interaction Hamiltonians can be divided into 

(A) particle conserving Hamiltonians, containing in each term as many creation 
operators as there are annihilation operators, and 

(B) particle nonconserving Hamiltonians which do not satisfy the condition stated 
under (A). 

A simple interaction Hamiltonian of type (B), conserving the number of fermions but 
not the number of bosons and reminiscent of the expression (23.6) employed in quantum. 
electrodynamics, is 
(30.19) Ai, = XD DY G(k’,k’, k)[at(k”) a(k’) b+(k) + hermitean conjugate] 

kK’ k’ k 


K " 
K: K, 


Fig. 30.6 (left and Fra. 30.7 right). Graphs representing (30.19). 


corresponding to the graph given in Fig. 30.6 and the reversed graph given in Fig. 30.7. 
Invariance requirements again impose restrictions on G, demanding that it be of the form 
(30,20) G(k",k/,kk) = gfe’ —k)25 2] Beste 

One of the basic ideas employed in the theory of interactions is to replace all direct 
particle interactions by intermediate boson interactions, so that all graphs are viewed as 
made up out of graphs containing only three-particle vertices such as the ones rendered 
graphically in Figs. 30.6 and 30.7. The perplexities caused by the infinite hierarchy of basic 
interaction vertices have been exhibited in Section 25. For the purpose of the present 
section, phenomena will be accounted for by introduction of a boson conserving interaction 
Hamiltonian 
(30.21) Hy, = 2 DLE Wk’, kk’, k) bt(k”) b*(ke”) b(k’) b(e) 

Keke’ kk 

corresponding to the graph given in Fig. 30.8, No attempt will be made to justify this 
expression by deriving it through perturbation theory or some other formal device from 


Daal ae 


An ig 


Fia. 30.8. Graph representing (30.21). 
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some other supposedly more basic interaction such as (30,19). In fact, the conditions 
under which an expression such as (30.21) will follow from a basic interaction such as 
(30.19) are, in general, unknown, and it is even an open question whether such conditions 
exist at all. 

A possible dissection of the graph in Fig. 30.8 in terms of the graphs in Figs. 30.6 and 
30.7 is given in Fig. 30.9. It is transparent that such a decomposition of, say, the empirical 


Fia. 30.9. Possible dissection of the graph Fig. 30.8 in terms of the graphs rendered in 
Fig. 30.6 and 30.7. 


interaction between two helium atoms is quite unrealistic, because if this interaction is 
essentially electromagnetic in origin, one has to solve, in principle, a six-body problem on 
account of the two nuclei and the four electrons involved in this interatomic interaction, 

Casting aside then all doubts regarding a possible deeper origin of the interaction (30.21), 
one concludes from the invariance requirements imposed by symmetry under translations 
and rotations that W be of the form 


(30.22) Wk’, k, Kk’, kk) = FEI’); (leo) 3 (Ie! e’”)] Spe ate nets 


Instead of the three scalars (k-k’), (k-k’’), (k’-k’’), one can use equivalently as scalar 
parameters characterizing the function F the barycentric energy (k+k’)? and the two 
momentum transfers (k—k’’)? and (k’—k’’)?. If one demands further that / represent a 


central force between any two bosons, expression (30.16) for the total Hamiltonian 
results. 


Suppose now, in accordance with the qualitative considerations stated 
earlier, there is an average number 7% of particles in their ground states 
k=0, so that the actual number of particles belonging to k =0 is 
Ny = Ny+n', where n’ may have positive or negative values so that 
n’ = 0 but in any case |n'| < 7g. The number of excited particles with 
k # 0 is then n, = n—7y)—n’. One expects the lowest states |r(n)> of the 
system of n interacting bosons to be a linear superposition of states 
|mo, 01> 


(30.28) |x(m)> = ¥ er(ms)]M0,01> = ¥ eg(n—iy—n')]fig +n’, 01> 


where c,(n,) is a properly normalized amplitude involving operators 
*(k) which create 7; particles from the state |...,0,> and where 


(30.24) b*(0) B(0)|20, 01> = mo|n0; 01>. 
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The sum ¥,, in (30.23) goes over positive and negative values of n’. The 
label + corresponds to some order of the lowest states, r= 0 denoting 
the ground state (say) of the entire system, 7 = 1 the first excited state, 
etc. One has further, from the representations for the boson creation 
and annihilation operators (18.4), 


(30.25) 5(0)|r()> = Y Vnoe-(mr)|0— 1, 01> 
(30.26) b*(0)|7(m)> = Y Ving + 1e,(1)|mo+1, 01> 


(30.27) b*(0)B(0)|7(m)> = x Mo C7(M1) |"; 01> 


= Y (Mo +n’) c,(r1)|%o+n’, 01>. 


n 


Thus far no approximations have been made. Since one will have 
|n'| < 7% for all appreciable amplitudes c,(n,) = ¢,(m—7%)—n’) one may 
write instead of (30,27) the approximation 


(30.27’) b*(0)B(0)|r(n)> = %o|7(m)>. 


By the same reasoning, and assuming ¢,(n,) does not depend sensitively 
on the precise value of the total number of particles n, one may write 
further the approximations 


(30.25’) 
b(0)|7(n)> = Vig Y c,(n— No —2') |g +n’ — 1, 04) 
= VN X el(n— 1) —(7ig +n’ — 1)] |g +2’ — 1, 01> 
= Vito|7(n—1)> 
and 
(30.26’) b+(0)|z(n)> = Vio|7(n +1)>. 


Since one wants to apply the Hamiltonian (30.16) to such lowest states 
(or superpositions of them if n is not fixed), it is useful to separate in 
Hi, the terms with k = 0 from those with k 4 0 so that the approxi- 
mations (30.25’), (30.26’), and (30.27') can be applied, One finds, first of 
all, the exact expression (5, means sum over all k ¥ 0) 
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(30.28) 
Ain, = b*(0)b*(0) 6(0) 6(0) F(0) 


+2b+(0) 10)| R00) D! bt (k) b(kk) +! P(e) b+ (k) v0e)| 
k k 
+ 0(0)B(0) BF (k)*(k) 6% —k) +-B°(0)b*(0) 3 () 6(R) B(—R) 


+0(0) x 2 [LF (k) +F(k’ —k)] 6*(k) b+ (k’ — k) b(k’) 
Kk’ #k 

+b*(0) x’ 2! [F(k) +F(k’)] 6*(k’ +k) b(k’) b(c) 

 #k 


k 
k’ 
1 Fe" — Ie) Byte wc BPC”) B*(k") B(e’) (Ie). 


+ 
Sah 
M 


Since this interaction Hamiltonian will be applied to low-lying states 
|7(n)> only, the entire Hamiltonian will now be approximated in accord- 
ance with Eqs. (30.25), (30.26’), and (30.27’) by putting 


B*(0) (0) = b*(0)B*(0) = B(0) (0) = Hy 
and neglecting terms of lower order in 5(0) and 6*(0), terms linear in 
6(0) and 6*(0) being of order V7, and terms containing no operators 


(0) and b*(0) being of order 1 compared to the terms with Mp in them. 
One obtains, thus, 


(30.29) 
A = figw(0)+ ¥! e(k) D+) B(k) +75 F(0) + 27%) F(0) &’ b¥(k) b(k) 


+27 y F(k) b* (ik) b(k) + 7p a F(k) [b*(k) b*(—k) + b(k) b(—k)] 


This Hamiltonian will be applicable even if the total number n of particles 
is not fixed, so that any low-lying state |7> can be written as a linear 
superposition of states with various n, |r) = Xnd(n)|7(n)>. However, if 
only state vectors describing a definite number n of particles are con- 
sidered, one may replace 7% by using 


(30,30) m = b*(0)b(0) +S’ b*(k) b(k) = Mot d b*(k) b(k) 
k 
and obtain, in the same approximation, 
(30.31) H = nw(0)+n2F(0)+ x [w(k) — (0) + 2nF(k)] b* (Ik) b (Ic) 
+n > F'(k) [b+ (k) b*( —k) + b(k) b( —k)]. 
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This Hamiltonian does not conserve the number of excited particles, 
which is not surprising because the entire approximation is just aimed at 
considering the single particle state with k = 0 as an effectively unlimited 
reservoir from which particles may be lifted by the interaction into excited 
states with k # 0. 

To diagonalize this Hamiltonian, Bogoliubov introduced new operators 


B(k) = u(k) b(k) — v(k) b*(—k) 
paw) 5*(k) = w(x) b*(k) — v(k) b( —k) 
where w(k) and v(k) are real numbers satisfying the relations 
(30.33) u(—k) = u(k); v(—k) = o(k); w(k)—v*(k) = 1. 


The minus sign in front of v in (30.32) is purely conventional, » need not 
be positive. With these conditions the transformation (30.32) is canonical 
ice. 5, 6* satisfy the C.R.s as do the b, 6*. Indeed, 


(30.34) 
(B(x) B*(k’)) 


I 


u(ik) w(k’) [b(Ic) b*(k’)] + v(k) o(k') [5*( —k) o(—k’)] 
—u(k) v(k’) [(is) b( —k’)] — u(e’) v() [5*( —k) b*(k')] 


w(k)—v*(k) for k = k’ 
0 for k # k’ 


[5(k) 5(k’)] = —w(kk) v(k’) [(Ic) 6*( —k’)] — w(k’) v(x) [6*(— ky) 5(K)] 
{—w(k) o(k’) + u(') 0(k)} 84, 


= —u(k) v(—k) + u( —k) v(k) for k = —k’ 
Ex i‘ for k # —k’ 


iH 


fu(he) ull) — o(he) 00k )}8x 4 = { 


Ml 


(5+ (he) B*(k’)] = {—w(hs’) o(lk) + w(k) 0(8')} 8x, ae 


_ f —u(—k) o(k) + w(k) o( —k) for = —-k’ 
= i for k # k’ 


which gives the usual C.R.s for boson operators if (30.33) is satisfied. 

The operators 6(k) and 6*(k) will now be referred to as the annihilation 
and creation operators of quasi particles of momentum k. The Hamil- 
tonian H can be expressed entirely in terms of quasi-particle operators, 
because (30.32) can be solved for the b,b* to give 


b(k) = w(k) B(k) + v(x) 5+( —k) 


99.85) b+(k) = u(k) 6*(k) + v(k) b(—k) 
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so that 
(30.36) b*(k) b(k) = v(x) + w(K) b+ (k) B(k) + v2(k) 6+(—kk) 6(—k) 
+ u(Ik) o(k) [* (Ik) 6+( — kk) + B(ik) 5( —k)] 
b*(k) b*(—k) = w(K) v(t) + w(kk) v(x) [6+ (k) 5 (ke) + 6+( —k) B( —k)] 
+u*(K) b+ (kk) 5+( —k) + v2(k) 6(Ic) (—k) 
b(k) b(—k) = u(k) o(kk) + w(t) v(x) [* (Ic) B(k) + 8+( —k) 6( —k)] 
+u*(kk) b(k) b( —k) + v2(k) b+(k) 5+( —k). 


One obtains then, using the symmetry properties w(—k) = w(k) and 
F(—k) = F(k) 


(30.37) A = A,+4,+fh, 
with 
(30.37a) Ay = nneo(0) +m F(0) + 5" {[w(Ik) — w(0) 
+ 2nF (kk)] v2(k) + 2nF'(k) w(x) o(k)} 
(30.37b) A, = > {[w(Ix) — w(0) + 2nF (k)] [w2(k) + 02(k)] 


+ 4nF (k) u(Ix) v(kk)} b+ (k) (k) 
(30.37¢) A, = 2 {[w(k) — w(0) + 2nF (k)] w(Ic) v(c) + 2. F (k) [u2(k) 


+0°(k)]} [6*(k) 6*( —k) + 6(k) 6(—k)). 


This Hamiltonian is obviously diagonal provided u(k) and v(k) are 
arranged such that A, = 0. If this is done, then 


(30.38) H= Ao+d’ o(k) 6*(k) 5(k) 


where now 

(30.39) @(k) = [c(kk) — w(0) + 2nF'(k)] [w2(Ie) + v2(k)] + 40 F (Ik) w(Ie) v(Ic) 

can be looked upon as the energy of a free quasi particle of momentum k. 
The condition for the vanishing of H, is 

(30.40) — [w(kk) — w(0) + 2nF'(k)] w(ik) o(k) +n 0" (k) [w?(Ic) + v?(Ik)] = 0 


which has to be solved in conjunction with (30.33). The solution can be 
obtained as follows. To satisfy (30.33) identically let 


(30.41) u(k) = cosh (a); v(k) = sinh (a) 
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so that [usingsinh (w) cosh («) = }sinh (2x); sinh® (2) + cosh? («) = cosh(2zx)] 
(30.42) (kk) o(k) = }sinh (22); u*(k) +v?(k) = cosh (22). 
Equation (30.40) then reads 


2nF'(k) 


(30.43) tanh (2x) = ~ jot) cal) + APE 


from which follows 

fusing sinh («) = tanh (x) [1 — tanh? (a)|-; cosh (7) = [1 — tanh? (a)]-!/?} 
nF (k) 

© -V [eo(k) — (0) + 2nF(k) |? — 4n® F2(Is) 

w(k) —w(0) + 2nF(k) 

V [w(k) — o(0) + 2nF'(k) |? — 4n? F2(k) 

By substitution into (30.39), this leads immediately to 

(30.46) &(k) = V[w(k) —w(0) + 2nF'(k)]? — 4n? F2(k) 
= V[o(k)— (0)? + 4nF(k) [o(k) — @(0)] 

Depending on the details of the interaction function F(k), the quasi- 
particle energy @ can thus depend, in principle, on the momentum in 
the manner anticipated in Fig. 11.1, leading to the appearance of super- 
fluid behavior as had been explained in Section 11. 

The state of lowest energy of the system can now be described, in 


accordance with (30.38), as the “quasi-particle vacuum state” |7 = 0), 
having the energy 


(30.47) Ay = nw(0)+n2F(0) +4 x [@(k) — w(k) + w(0) — 2nF'(k)] 


(30.44) u(k) v(k) = 


(30.45) —-u2(k) + v%(k) = 


which is obtained by substitution of 
w(k) — w(0) + 2nF(k) — &(k) 
20(k) 
_ 2n?F(k) 
~ @(K) [w(k) — w(0) + 2nF'(k) + &(k)] 

This result confirms the expectation that the quasi-particle vacuum 
is not identical with the state in which all ordinary particles occupy their 
ground state. In fact, v°(k) is identical with the average number of 


ordinary particles in the momentum state k, because, from the first 
Eq. (30.36), it follows that 


(30.49) <r = 0|b+(c) b(k) |r = 0> = nk) = v°(k) 


(30.48) v(k) = }[cosh (2x)—1] = 
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which will, in general, be of a form anticipated in Fig. 30.4. 

An analogous treatment is possible for a system consisting of NV 
fermions, with similar results, the quasi-particle vacuum being in this 
case a state which, depending on the kind of interaction present between 
individual particles, may involve a spreading out of particle-occupation 
states near the Fermi surface. Consider, first of all, a system of non- 
interacting fermions, described by the Hamiltonian 


(30.50) Ho = % Qk) a*(k) a(k), 


which conserves the particle number N = «|My = (|X, @*(k)a(k)|>. In 
the state of lowest energy, all levels up to a certain kp (the “Fermi 
surface’’) will be filled, and all levels above it will be empty. If one is 
interested only in small deviations from this ground state, one is led to 
consider instead a description of the system in terms of the Hamiltonian 


(30.51) Ay = Hy—QpN = ¥ [Q(k)—Qp]a*(k)a(k) 
K 
where 927 = (ky) is the Fermi energy associated with ky, to be deter- 
mined from the requirement 
(30.52) <|N|> = N. 
To find the eigenvalues of H, introduce new operators 


a(k) for Q(k) > Qp 
at(—k) for Qk) < Q, 
(k may stand here for both momentum and spin labels of fermions), which 


satisfy, as had been shown in Section 17, the same anti-C.R.s as the a(k) 
themselves. One has then 


(30.53) a(k) = { 


(30.54) 
EM ST eee oe eins re es -k) for Qk) <Q, 
so that one may write, using 2(—k) = Q(k), 
(30.55) Ay = & [Q(k)—Qp][1—-a*(—k) a(-k)] 
Ak) < Oy 
+ z [Q(k) — 2p] (Ik) Ake) 


MAk)> Qe 


=ett Qk) &* (kk) G(k) 
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with 
(30.56) Qlkk) = |Q(k)— 2p]; &e = F (Qk) Dy}. 
One has, thus, a description in terms of quasi particles whose annihilation 
and creation operators are d(k) and @*(k). For Q(k) > Qp a quasi particle 
of momentum k corresponds to an ordinary particle of momentum k, 
but for 2(k) < Qy the quasi particle of momentum k corresponds to an 


ordinary hole of momentum —k. The vacuum state |0> of ordinary 
particles is defined by 


(30.57) a(k)|0> = 0 
whereas the quasi-particle vacuum state |0> is defined by 
(30.58) ak) |0> = 0 


which in the absence of interactions is physically identical with the 
ground state of the system of NV ordinary fermions. Since 


(30.59) A.|0> = ex|0> 
and 
(30.60) Hy a&*(k)|0> = [ep +Q(k)]@*(k) |0> 


one is justified in calling Q(k) the energy of a quasi particle of momentum 


k, In the quasi-particle vacuum |0> the number of ordinary particles, 
represented by 


(30.61) 
N= 3 a*(k) a(ke) = py (1 —@* (kk) @(k)]+ x &* (k) d(k) 
Ak)< Qe Ak) > Qe 
has, of course, the expectation value 
(30.62) <O|N|O> = zy 1=¥, 
Ak)<Qp 


whereas the number of quasi particles, 
(30.63) N= © a*(k) a(k) 
has the expectation value 


(30.64) O|N|0> = 0. 
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As in the case of noninteracting fermions, the Hamiltonian describing 
a system of weakly interacting fermions is most conveniently written so 
that its eigenvalues give the energy in terms of the deviation from the 
energy of the state in which all particle levels up to energy 2, are filled, 
(30.65) 

H = YY [Q(k)—Qp]a*(k, 8) a(k, 8) 
ke 
FEUD VVUV DCR KP K RW Sete wd (k”, 8) 
rok ke 8 
x at(k”, 8’) a(k’, 8’) a(k, 8). 
The spin labels have been written out explicitly, but the interaction is 
taken to be spin independent. 

It was first noticed by Cooper that the interaction, if attractive, can 
bring about correlations between pairs of fermions giving rise to a ground 
state in which the Fermi surface is smeared out, and that this result 
cannot in principle be obtained by perturbation theory. 

Cooper’s qualitative considerations can be made more precise by 
introducing, following Bogoliubov, quasi-particle operators 
(30.66) Gk, 1) = u(k) a(k, 1)—v(k) at(—k, 2) = &(k) 

. a(k,2) = u(k) a(—k, 2)+v(k)a*(k, 1) = A(k). 


This transformation is canonical (see Appendix 5), i.e. the @ and a@* 
satisfy the usual fermion anti-C.R.s 
(30.67) {@(k, s)@*(k’, s’)} = ben 8y,03 allother { } = 0, 
provided w(k) and v(k) are real numbers subject to the conditions 
(30.68) w(k) = u(—k); o(k) = o(—k); uw?(k)+v2(k) = 1. 
Tf, in particular, 
ae ayia for Q(k) > Qp 

Oe) Cia te COS ie for Q(k) < Qy 
one obtains essentially the transformation (30.53) for the case of non- 
interacting fermions, namely 
(30.70) 

a(k, 1) = a(k, 1); G(k, 2) = a(—k, 2) for Q(k) > Qp 

G(k,1) = —a*(—k, 2); G(k, 2) = at(k, 1) for Q(k) < Qy. 
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One may thus, in this case, look upon 


particle (k, 1) outside 
hole (—k, 2) inside 
the Fermisphere 


oA ee (particle (—k, 2) outside 
@(k, 2) asthe annihilation operator of a ( Hele (ety raide 


the Fermisphere. 


@(k, 1) as the annihilation operator of a ( 


In the general case (30.66), application of an operator a engenders 
superposition of a particle and a hole. 
Equations (30.66) can be solved for a(k, 1) and a(—k, 2) to give 


(30.71) 
a(k, 1) = w(k) @(k, 1) +0(k) @*(k, 2) = u(k) & (I) + v(k) B+(k) 


a(—k, 2) = u(k) @(k, 2)—v(k) @*(k, 1) = w(k) B(k) — o(k) &*(k). 


One can now substitute these expressions into the Hamiltonian (30.65), 
making use of the symmetry properties of the matrix element 


(30.72) (k"k"|F|k’k) = (k"k"|F|kk’) = (—k’—k|F|-k”—k", 


where the last equality follows from time reversal invariance, and 
diagonalize the resulting transformed Hamiltonian by suitable choice 
of u(k) and v(k) in a manner analogous to the one employed in the 
corresponding problem for interacting bosons. The calculations are 
rather lengthy, and the reader is referred to the work of Beliaev, quoted 
at the end of this section, for details. 

The principal result is a confirmation of Cooper’s more qualitative 
theory mentioned earlier, and a description of the system in terms of a 
quasi-particle excitation spectrum which, as in the case of interacting 
bosons, satisfies the criterion for the existence of supertransfer pheno- 
mena established in Section 11. 

It is instructive to express the ground state and the lowest excited 
states of the entire system in terms of the vacuum state and ordinary 
particle creation operators. The true vacuum state is defined by 


(30.73) a(k,s)|0> = 0 ie. a(k, 1) |0> = a(Ik)|0> = 0; 
a(—k, 2)|0> = B(k)|0> = 0 
while the ground state (the ‘‘ quasi-particle vacuum’”’) |0> is defined by 


(30.74) (kx, 1)|0> = a(k)|0) = 0; (Ik, 2) [0> = B(x) |0> = 0. 


306 Concepts in Quantum Mechanics 

Equation (30.74) must follow from (30.73) upon application of the 
unitary operator [see Eq. (A5.32)] 

(30.75) Ug = 1—o [1 —u(k)]—toe v(k) 

where 

(30.76) o9 = l—ata—Bt B—2a* «Bt B; a = i(at BY —Ba) 

so that 

(30.77) &=Up,aU}; fB=U,BU};  [0> = slo». 


Now application of o) and c, to the vacuum state gives 


(30.78) o/0> = |0>; 90> = iat Bt 0 

so that 

(30.79) |O0> = Up|0> = [w(k) +0(k) at B+] |0> 

or, generally 

(30.80) |0> = [| (wl) + 0(k) a* (kk, 1) a+(—k, 2)] |0>. 
k 


This expression was, in fact, the starting point of the famous BCS theory 
of superconductivity, in which the ground state is envisaged as con- 
taining correlated pairs of electrons in accordance with Cooper's idea. 

State vectors describing the presence of various numbers of quasi 
particles are constructed by using the same procedure as above: Apply 
U,, to the corresponding ordinary particle occupation state. For example, 
for specific k, one has 
(30.81) [laa = &*(kt) [0 = Upat(k) 0) = Tp] 1aa> 
with 
(30.82) Uz = I {1 —o9(k’) [1 — u(k’)] — tog (k’) v(k’)}. 

It is well to keep in mind, however, that the particular deformation of 
the Fermi surface envisaged in the BCS theory is by no means the most 
general such deformation which may lead to a lower energy of the entire 
system as compared to a state with undeformed Fermi surface. In 
accordance with the most general transformation given in Appendix 5 
one can, for example, consider nonstationary deformations which may 
conceivably lead to an even more effective minimization of the total 


energy of the system. In fact, such more sophisticated quasi-particle 
excitations have been considered by Landau in connection with the 
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theory of liquid helium-3, which seems to admit the presence of ripples 
on the Fermi surface, known as ‘“‘zero sound.” In any case, the mathe- 
matical penetration of the problem posed by interacting bosons or 
fermions is far from complete and remains one of the most promising 
fields for the application of quantum-mechanical concepts. 


NOTES 


Einstein [1] discovered the condensation phenomenon now bearing 
his name. The possible connection between permutation symmetry and 
macroscopic properties of liquid helium II was stressed by London [2]. 
The significance of Hinstein’s characteristic temperature in terms of 
countability of particles and the limitation encountered by Maxwell’s 
demon at low temperatures was pointed out by Kaempffer [3]. 

The treatment of the weakly interacting bosons is due to Bogoliubov 
[4]. See also the review article by Beliaev [5]. The treatment of the 
weakly interacting fermions is due to Bogoliuboy et al. [6]. See also 
Bogoliuboy [7] and Valatin [8]. 

The crucial observation that the correlation between pairs near the 
Fermi surface cannot in principle be obtained by perturbation theory 
follows from the work of Cooper [9], leading to the BSC theory of 
superconductivity [10]. 

The theory of Fermi liquids has been developed by Landau [11]. See 
also the review article by Abrikosov and Khalatnikov [12]. 
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'PENDIX 1 


The Eigenstates of Angular Momentum 


The component operators of angular momentum, denoted J,, Js, J’, 
satisfy the C.Rs 


(A1.1) J1J2—-J_J, = iJ (cyclically) 


and are, therefore, mutually incompatible observables. Since 
J* =J;+J3+J% commutes with all operators J;, for example, 


(A1.2) 
P7I3—JI3J* = J2J3+J3J3—J 3 J3—J 333 
= IJ 3J1—tJ 2) +J oJ gJ 2+) 
—(WSiI3+tJ2)J,—(J2I3—tJ1) Jz = 0, 


there should exist simultaneous eigenstates of J? and of one of the 
components J, By convention, a representation will be sought in which 
both J* and J, are diagonal matrices. 

It is convenient to introduce the operators 


(A1.3) Jo =Sit+g; Jt =J1-is, 
They have the properties 

(Al.4) JJé = J? +JI3—Jt 
(A1.5) Jg Io = J?-J,—Jz 
(Al.6) JgI3—J3Jf = St 
(Al.7) TJ 3—J3Jq = —Jo 


as is easily verified by computation with (A1.3) and (A1.1). 

Now denote the eigenstate of J, with eigenvalue m by | ,m> the empty 
space in front of the label m being left open for the quantum number 
characterizing the value of J* in that state, so that 


(A1.8) J3|,m> = m| ,m>. 
309 
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If one multiplies this equation from the left by Jf, 


(A1.9) JT] -m> = mI§| ,m> 
and utilizes the C.R. (A1.6), one obtains the equation 
(A1.10) JTsJs| ,m> = (m—1)Jg| ,m> 


which says that if | ,m) is an eigenstate of J, with eigenvalue m, then 
J§| ,m> is also eigenstate of Js, but with eigenvalue (m—1). In other 
words, the state J{| ,m)> is, up to an as yet undetermined normalization 
constant c, identical with the state | ,m—1), 


(Al.11) J§|,m> =e] ,m—1). 


Similarly, upon multiplication of (A1.8) from the left with J and utili- 
zation of the C.R. (A1.7) one obtains the equation 


(A112) JgJo| ,m> = (m+1)Jo| ,m> 


which allows one to identify, up to a normalization constant d, the state 
Jo| ,m> as an eigenstate of J, with eigenvalue (m+ 1), 


(A1.13) Jo|,m> = d| ,m+1>. 
The operator J) generates thus an ascending sequence of eigenstates 
|,m>, |,m+1), ..., whereas the operator Jj generates a descending 


sequence | ,m)>, | ,m—1), .... 

Each sequence comes to an end, however, for the following reason. If 
| ,m) is a simultaneous eigenstate of J, and J* then, as a consequence of 
(A1.4) and (A1.5), one has 


(A1.14) JoJé| ,m> = (J?+m—m?)| ,m> 
and 

(A1.15) JgJo| ,m> = (J*?-m—m?)| ,m) 
where J? can be treated as a number. Since 

(A1.16) 


<,m|JoJs| ,m> = Js (,m)|Js(,m)> = |e]? ,m—1] ,m—1> > 0 
and =0 onlyif J¢|,m> =c|,m—1>=0 
and 


(A1.17) 
<,m|JgTo| .m> = Jol ,m)|Jo(,m)> = |d|?< ,m+1|,m+1> > 0 
and =0 onlyif Jo|,m> =d|,m+1> =0, 
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Eqs. (Al.14) and (A1.15) mean that 
(A1.18) J?+m—m= > 0 
(A1,19) J?—m—m* > 0. 


For given J* this is consistent only if the descending sequence comes to 
an end for a certain mj, for which 


(A1.20) 
FJ? + Moin — Main = 0 Le. Moin = +4-VIF+} and | ,2Min—1> = 0 
and if the ascending sequence comes to an end for a certain max for which 
(A1.21) 
J*— Max — Max = 0 iC. Mmax = —$+VI7+$ and |, 2max+1> = 0. 


Since m changes by integers, the difference between max aNd Mpjn MUSt 
also be an integer > 0. One may write this 


(A1.22) Mmax—Mmin +1 = 2VI7+F = +l 

where j can have only the following values 

(A1.23) 9 = 04,14... 

Thus the possible values of J® are of the form 

(A1.24) J? = j(j+l) 

and for given j the number m can assume the (2j +1) values between 
(A1.25) Mmox = +53 Min = —J 


The simultaneous eigenstates of J* and J, will accordingly be labeled 
|j,m), satisfying 
F*j,m> = jG+1)|j,m>; jf = 0,4,1,3,... 
|j,m> = m|j,m); m= —j,-j+1,..., +). 


Tfallstates | |j,m> are normalized, then the constants candd are obtained 
by squaring Eqs. (A1.11) and (A1.13), 


(A1.27) 
Je|?<j,m—1]j,m—1> = |e|® = CIE (j,m)|J$(j,m)> = J2+m—m? 
= jG +1)—m(m—1) 


(A1.26) 


o 


(A1.28) 
|d|2<j,m+1]j,m+1> = |d|® = WJo(j,m)|Jo(j,m)> = J2—m—m2 
= j(j+1)—m(m+1) 
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so that one may write, up to an arbitrary phase factor Eqs. (Al.11) and 
(A1.18) explicitly 

(A1.29) Jé\j.m> = ViG+1)—m(m=1)|j,m—1> 

(A1.30) Jolj,m> = ViG+1)—m(m +1)|j,m +1). 


Tf one represents the states |j,m> for given j by unit vectors with 2j +1 
components, so that with a conventional choice of phase one can write 


(A1.31) 
1 


" : 0 7 ‘ 1 \\e ; 
ae Peel | | ibe eet) |i aie 


then by (A1.29) and (A1.30) the only nonvanishing matrix elements of 
J§ and Jo for given j are 

(A1.32) 

(Fé )m—1,m = VIG+1)—m(m—1); — (To)me1,m = VIG+1)—m(m +1) 
In particular, one obtains for j = 4 the representations 

(A1.33) 


d= () B—D=() e=G  H=G o) 


Hus+Id)=5(7 4): Je = G2WE-Id) = 3() 9): 


i 
L/L ON a afl 0 
Ja =53(4 cat a = §(6 i): 


and for j = 1 the representations 


i 0 0 
(A1.34) |1,1) = (\): |1,0> = () ay ('): 
0 0 1 


0” OG 20 0 V2 0 
Jj = (v2 0 ) Jo= ( 0 v3); 
V2 m i fH 


Ji 
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The representation (A1.33) is identical with the one obtained in Sections 
1 and 2 for the spin states and operators, because one has the identities 


|4,4> = |a+> and |},—4> = |a_>. 

If one considers the special case of the orbital angular momentum L of 
an object, whose position and momentum are denoted by Q and P, 
respectively, then J = L can be decomposed according to (12.2) in the 
form 


(A1.35) L = (QxP) 


Working in coordinate representation one can derive (A1.26) again, but 
with the restriction that j = 1 be an integer only. Thus any angular momen- 
tum with half-odd integer value of j must at least partly be due to an 
intrinsic or spin angular momentum that does not permit a decomposi- 
tion of the form (A1.35). 

This derivation is most conveniently carried out in coordinate repre- 
sentation by the introduction of spherical coordinates 7,3, 9, such that 
(A1.36) 

Q1 = rsin? cos 9; Qs = rsindsin 9; Q3 = reosd; 
and 
(A1.37) 
(8/6Q,) = sind cos 9(8/Ar) + cos# cos 9(1/r) (8/00) —sin ¢(1/rsin #) (0/0¢) 
(2/8Qs) = sin#sin ¢(/r) + cos#sin 9(1/r) (0/09) + cos 9(1/r sin ®) (0/0¢) 
(8/@Q3) = cos #(8/Ar) —sin i(1/r) (0/09) 


so that the angular momentum component operators are represented by 


L; = —i(0/0¢) 
A1.38) Lé = L,-iL, = —e-?[(0/09) —i cotan #(0/A9)] 
a Lo = L, +i, = e'?[(8/09) +i cotan9(d/d¢)] 


L? = —(l/sin) (2/29) [sin 9(8/20)] — (1/sin?) (22/292) 


which do not contain r explicitly. It is therefore reasonable to introduce 
for given orbital angular momentum j = 1a function, namely <8, 9|l,m), 
where |#, 9) is the state in which the angular momentum of amount / is 
with certainty aligned in direction #, 9, and denote it 


(A1.39) Yi,m(, ) = (, el, m> 
so that 
(A140) |9,@> = Pp» |l, m> <I, m|9, @> = x |, m> Y#«m(O, 9) 
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and 


(A1.41) 


a ln 


|i,m> = f |, ¢>dQ@, g|l,m> = | f |0, @>sindddde Y,, m(0s 9), 
00 


which subject the functions Y;, ,,(,@) to the normalization condition 


(A1.42) 


win 


<U,m'|lm> = ff YE ws 9) Yim(s 9)sindddde = by18y m- 
00 


Equations (A1.26) can now be written as eigenvalue equations for the 
functions Y;,, »(0, 9), by applying the operators (A1.38) to (A1.41) after 
the fashion of (8.9). One obtains 


(A1.43) —1(0/09) Yi, m(9, @) = MY, m(9, 9) 
and 


(A1.44) 
{—(1/sin 9) (8/48) [sin 9(0/29)] — (1/sin*d) (°/A9°)} Yi, mld, 9) 


= UL+1) Yi, m(d, ¢) 
From Eq. (A1.43) one deduces immediately that 
(Al.45) Yi, m(9, p) = Fi, m(9) e"? 


where now the as yet undetermined function F’,,,,(#) must, according to 
(A1.44), satisfy 
(A1.46) 

(1/sin 0) (0/09) [sin P(OF,, m/O9)] + (UL + 1) — (m?/sin? 3) | F;, », = 0 


with l>m> -l. 


Tf one requires of the solutions F, ,,() that for all 0 <# < 7 they should 
be unique, finite, and differentiable, and also subject to normalization 
after (A1.42) 


ca 20 7 
(A1.47) J J |Yi,m[2sinddddg = 2n J |Fi, n(O)|2sinddd = 1, 
o=0 


o=0 9=0 
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then the general solutions are 


(A1.48) 


| (Qt+1) =m ome (1 a Lead ee 
Yi,m(d, 9) sin, dn(l+m)! amp ainbas ("amaze *™ %) 


i—m differentiations 
(+1) (I—m)! 
4n(l+m)! 


« sin” #(cos' "9+ a cos" 9+...) em? 


Py'(cosd) em? 


where J must be a non-negative integer. Non-integer values of | cannot 
lead to solutions satisfying all the requirements. 
For 1 = 0,1 = 1, 1 = 2 these solutions read explicitly 


(A1.49) 


Yo,0(9; ¢) Soret 


A i 3 
¥i1(8, 9) = [sinoe®: Yi0= | q,00885 
Yaa = ,/esinde®; 


1 
Yoo = YE2= ide sin?pe*?; Yo, = YF4= 5 | Besind coset; 


we me 


Another phase convention used frequently in the literature consists of 
multiplying each Y,,, by (—1)” to yield 


(A1.50) Yi", 9) = (—1)"Yi,m(; 9)- 


NOTES 


The content of this appendix can be found in practically all texts on 
quantum mechanics. Monographs on the subject have been written by 
Edmonds [1] and Rose [2]. 
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PPENDIX 2 


The Addition of Two Angular Momenta 


Suppose two angular momentum operators J, and Jy are given, each 
satisfying J, x J, = iJ, (« = 1,2), so that there exist eigenstates |j,,m,> 
of J2 and J,,; with eigenvalues 
(A2.1) 

Tal GasMa> = Jalja+1)|Jar Mas Ja = 4,14... 


Texs| jas Ma> = Ma| Jers Me? ju > My > —ju; Am, integer. 


Now consider an object whose total angular momentum J can be 
represented as the vector sum of the angular momentum operators J; 
and J, of the two constituents making up the object, 


(A2.2) J=T4+5, 

so that J still satisfies the C.R.s 

(A2.3) JxJ =i. 

There must then exist eigenstates |j,m)> of J* and J, having the property 


au J*|j,m> = j(7+1)|j,m>; jf = 0,4,1,4,... 
“" Js|j,m> = mlj,m>; jg > m > —j; — Aminteger. 

One tries now to represent the state |j,m) as linear combination of the 
direct products |j1,71>|j2,me> in the form 


jt hs 
(A2.5)  |j,m> = Yo LY Oj, jrj23m, my, mz) |j1,M1> |Je,me2>- 
m=th, mts 
The coefficients C of this unitary transformation are called the “Clebsch- 
Gordan coefficients” or, sometimes, the “‘ Wigner coefficients.” This way 
of writing |j,m)> is suggested by degeneracies of the states |j,m> and 
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|j1,m1> |jo,me> with respect to the quantum numbers m and m1, Ms, 
respectively, i.e. the dimensionality of the space spanned by | j,m) is the 
same as the dimensionality of the product space spanned by |j1,7> 
|j2,m2>. 

To see this, label by convention the constituents so that j, > jo. The 
total angular momentum will assume any of the values j for which 
j= ithe jitje—, -.sJi—jet1, ji —Je. For each such value j there is 
a degeneracy 2j + 1,80 that the sum of all degeneracies, i.e. the degeneracy 
of |j,m > with respect to m, is 


(A2.6) [2(j1+5e) ++ (2QQGi+d2—H +H +..-+ 2G -Jet V4 1) 
+[2(j1-Je) +1] 
= (2; +1) (number of brackets [ ]) = (2j1+1) (2j2+1) 
which is equal to the degeneracy of the product state | jum |jo,™2> 
with respect to m1, Mz. 
If the normalizations and phases of |j,,7m,.> are given, then up to an 


arbitrary phase factor the coefficients O are completely determined by 
(A2.4) and the normalization of |j,m)>. It turns out that j, and jz add to 


«aca aac 


Js 


elle alba 


Fic. A2.1. The vector addition of two angular momenta J; and Jz. 


j vectorially (see Fig. A2.1), whereas m, and m, add to m algebraically, so 
that 


(A2.7) m, = M—Ms, 
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and one may simplify (A2.5) by writing 


ih 
(A2.8) \j,m> = Z F C(j,J1sd25 mam —™M) |j1,m1> |jz,m—m>. 


m= +h 

For the purpose of this work it is sufficient to tabulate the Clebsch- 

Gordan coefficients for addition of an angular momentum j,. = } to an 

arbitrary j, (Table A2.1) and of an angular momentum j, = 1 added to 
an arbitrary j; (Table A2.2). 


As an example, consider an object “1” and an object “2” both having 
spin }. According to (A2.6) the degeneracy with respect to m is fourfold, 
TABLE A2.1 
Tue Four Nonvanisuine Corrricrents O(j,j1, 4; m,mi,ma) 


my = m—t my = m+h 
m,=+t me = —} 


jitmthy 2 — (j1—m+4\1/2 
(Gar) Coat) 


g=htt Bitl itl 
adie oh ey 
oe Wari atl 


and one reads from (A2.8) and Table A2.1 the four spin states of the 
object composed of objects “1” and “2”: 


(A2.9) 

j=0: |0,0> = (1/2) (|4, 1/4, —)2—-|4, — ald Do) 
|L,1> = [4,114 De 

p= |1,0> = (/V2) (44> 14, —De+ld, — Hild 2) 


1, == 4, —Pailt —bDe 


Whenever the values j, and j, are equal it is important to maintain the 
labeling of the product states made up out of the objects “1” and “2”, 
as has been done here by the subscripts as in | >, and | >, because the 
transposition operator 7’; , defined in Section 27, operates on these labels. 

One recognizes by inspection the singlet state belonging to j = 0 as an 
antisymmetric eigenstate of 7’). and the triplet of states belonging to 
j = 1 as symmetric eigenstates of 7’, >. 

As a second example, the states resulting from the addition of two 
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(1+ Vee [ (1+ 1e)te | 5, 
a/t lees +u+tl) a/t 6), 


(T+ He) 
a/t la +w—Tl)(w—T6) 


(ue + 1L)(u — Te 
(+t) l(t + TO) 
al ++ Tl)(m—Tl) u+ 
(T+ TT +1) 


(+The a 
onl ui —TA)(w+ 10) 


(e+ Te)(t +t) 


co 
w= 
LHF 


[ (e+ tert) 
et 


(T+w—T)(w—t6) at l(t ut TOT +ue— Tl) 


at (Tut te)(ue + 16) 


I- = tw ou 
tw = tw uw = tw 


o 
i 


Bue 
Tu 


(Tu ‘we $1 00) 9 SENHIOIMATOD ONTHSINVANON ANIN SHI, 


Tév ATAVL 


The Addition of Two Angular Momenta 321 
angular momenta j; = 1 and j, = 1 will be recorded here. In this case, 
Table A2.2 becomes applicable to (A2.8), and one has 
(A2.10) 
j=: 0,0> = (1/3) (1, Dy 1, —12—|1,0>1]1, 2 

+1, -1, 1, 19) 
1, 1> = (1/V2)([1, 1), |1,0>2—|1, 01 |1, 1») 
j=l: 1,0> = (1/V2)(|1,1>,|1, —De—[1, —1)1[1, 12) 
1, —1> = (1/72) (|1, 093 |1, —12—|1, — 11], 02) 
2,2) = 1, 1);|1, by) 
2,1) = (1/72) (1, 1>1|1, O>2+]1, 0, [1, 12) 
2,0> = (1/76) (|1, 11 ]1, —1>2+2|1, 011, Oe 
Mt 1, -), 1, 12) 
|2, —1> = (1/¥/2) (1, 0>1|1, —L2+]1, —1)1|1, 02) 
{]2,—2> = |1,-—Dall, -—De 


Once again, inspection reveals the transformation properties of these 
spin states under transposition 7';» of particle labels: The triplet of states 
belonging to j = 1 are antisymmetric, and the singlet state belonging to 
j = 0, as well as the quintet of states belonging to j = 2, are symmetric. 


NOTE 


The classic source for Clebsch-Gordan coefficients is the work of 
Condon and Shortley [1]. 
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APPENDIX 3 


Vector Spherical Harmonics 


By taking in (A2.5) 7; = and j, = 8 = 1, one obtains for each j three 

different angular momentum states, 
-l -t 
(A3.1) |j,m3lse=D= ¥ ES Cj, 1,1; m, my, m,) |t, m> |1, m,> 
m= +l m=+1 

where m = m,+m,, corresponding to the three possibilities 7 = 7+1,1,J—1 
orl=j+1,j,j—1. The spin states belonging to s = 1 can be represented 
by ¢ functions which transform as vectors, for example, the eigenstates 
of ss [see Section 18, Eq. (18.68)], namely §1, €°, €-!, whereas the orbital 
states can be represented by spherical harmonics which depend on n, 
the direction of momentum, so that the total 4 function of an object of 
spin 1 and energy w must be a linear combination of three “vector 
spherical harmonics” 


(A3.2) 
=I -1 
Yjim(n) = z, x 1,1; m,m, ms) Yi,m(n)B™; = 1m = m—M,; 
m= ny= 


corresponding to the three possible values of /, namely j +1, 7, 7-1, and 
may therefore be written 

i 
(A3.3) fiyim(n) = >») ¥ @,(@) Yim (1). 


taj+ 
For photons, the expansion coefficients a,(w) are restricted by the 
condition of transversality (18.72), which reads here 
(A3.4) Dfyjm(n) = 0. 
The vector n can be represented by its components n“ defined by 
=1 
(A3.5) Das >) inher. 
u=+1 
With the representations (18.68) one finds for n# in terms of the cartesian 
components 7; = sin#cos 9, N: = sind sin p, Ns = cosd, 


ntl = —(1/*/2)(n;—ing) = —(1/V/2)sinde“? 
(A3.6) n> = ng = cos} 
n-* = (1/2) (ny+ine) = (1/2) sin de’. 
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Now from the three vectors Yj,(m) one can construct three linearly 
independent combinations, labeled ves (A= +1,0,—1), of which the 
first two are transverse, i.e. satisfy 


(A3.7) n-¥i,=0 for A= +1,0, 
and the last one is longitudinal, i.e. satisfies 
(A3.8) nx Yj = 0. 
These linear combinations are, with proper normalization, 
J j+l 
(Aa.8) Yin = fp tsnet [Fe id 
(A3.10) AG ae in Gare 
i; j j+1 
(43.11) Yj = bape terum— fF tase A 


and one has, by solving these equations for Y; , ,,, the following decom- 
positions of the vector spherical harmonics (A3.2) into longitudinal and 
transverse parts, 


(A3.12) Vij = ven Y#i- Vj+1Y7A) 
(A3.13) Yjj,m = Yn 
(A3.14) Yj jm = yeni =i (Vj+1Y fn + V5 Xin). 


The # function of a transverse photon with definite angular momentum 
and energy will therefore be, in general, a linear superposition 


(A3.15) fim = 4 V$h +40 Vn 


with coefficients a, and dp» subject only to the normalization condition 
|ay|* + |ao|* = 1, and depending only on w, a, = aa(w). 

The arbitrariness in the choice of the parameter \ is removed if one 
requires that a photon state have a definite parity. From the transfor- 
mation property of the spherical harmonics Y;,,,(n) under inversion of 
coordinates, 


(A3.16) TTY, m(0) = (—1)¥i,m(0) = Yi,m(—2) 
and the vector nature of § in n space, 
(A3.17) TE = -& 


follows that 
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(A3.18) TTY jm() = (—1)** Xm (1) 
and the transverse and longitudinal parts transform as 
(A3.19) Ty}, = (—1)"*'y},,. 


Transverse photon states belonging to A= +1 are called ‘electric 
multipole” states and have parity (—1)/, whereas transverse photon 
states belonging to A= 0 are called “magnetic multipole” states and 
have parity (—1)'*!. States belonging to j=1, 2, 3, ete., are called 
dipole, quadrupole, octopole, etc., states. This terminology derives from 
the asymptotic behavior of the corresponding % functions in coordinate 
space which resembles that of the classical fields emitted by the respec- 
tive multipoles. For details the reader is referred to the references at the 
end of this Appendix. 

It will be noticed that for j = 0 there exists only one vector spherical 
harmonic, namely Yo j,9 = —Yo,, which is necessarily longitudinal. 
This means there exists no single transverse photon state of total angular 
momentum j = 0. 

As has been shown in Section 27, two-photon states of odd parity are 
associated with the antisymmetric spin states (A2.10) |s =1;m,> and 
states of even parity belong to the symmetric spin states |s = 0,2;m,). 
The transversality condition requires for each photon that 


(I) if the spin states |s = 1;m,) are represented by a vector spherical 
harmonic, then this must be the longitudinal Y~', and 
(II) states of even parity must be represented by linear super- 
positions of spin states |s = 0;0> and |s = 2;m,>. 


Because of the three-component nature of spin vectors, the ¢ function 
associated with a two-photon state can be represented by a tensor in 
spin space, denoted [/(*)],,., Where «, and « are spin labels of the 
individual photons, and x refers to the relative momentum of the two 
photons, which is the only remaining variable after the center of mass 
momentum has been separated. In this notation, the transversality 
condition for each photon may be written 


(A3.20) (ar, LF (2) Jo oe = [A (2) Jer ors (*)ay = 0- 


Now, for two-photon states of odd parity, the spin states are anti- 
symmetric, which means /,,,, is a skew tensor in three dimensions, and 
therefore representable in the form 


0 A, —A;,) 
(A3.21) Se pe 0) eds, ) 
Agnes 
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with an axial vector A, allowing one to write the transversality condition 
(A3.22) nxA=0. 


This means the vector A representing the antisymmetric spin state of 
two photons must be longitudinal, and therefore, for given j and m, 
according to (A3.11) 

7 7 
(43.23) A=YR= Be teuen AE sue 
This function has odd parity only when j is even. Therefore, there exist 
no two-photon states of odd parity if j is odd. In the language of selection 
rules, one can thus say that a particle of odd integer angular momentum 
and odd parity cannot decay into two photons under conservation of 
these quantities. 

For two-photon states of even parity the spin states are symmetric, 
which means /¢%? is a symmetric tensor in three dimensions, having six 
linearly independent components, corresponding to the six different 
values of 1 possible for given j and m, namely 1/=j,j+1,j+2 for s=2 
and / = j for s = 0. Since the parity (—1)' is specified to be even, however, 
for even j > 2 only four components can be unequal to zero, namely 
1=j, j+2 for s=2 and /=j for s = 0, whereas for odd j > 3 only two 
components can be nonvanishing, namely |=j+1 for s= 2. Special 
cases are j = 0, when there exist only two components, corresponding to 
1=2 for s=2 and /=0 for s=0, and j =1, when there is only one 
component, corresponding to / = 2 for s = 2. Among these, the trans- 
versality condition causes further restrictions. For even j > 2 there are 
only two transverse states and for odd j > 3 there is only one transverse 
state. In the special case j = 0 there is only one transverse state, and for 
j = 1 there is no transverse state. 

Further details may be obtained from the references listed below. 
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ee APPR NUIS 4 


The Invariance of Dirac’s Equation under Lorentz 
Transformations 


Dirac’s equation, in momentum and energy representation 


(A4.1) (ysw—yk) A = mA 

ean be written, with the conventions 

(A4.2) A,B, = A,B,— AB; w=k, 
in the form 

(A4.3) (yy ky—m) A(k,) = 0. 


Requiring that the matrices y are the same in all coordinate frames, 
eovariance of Dirac’s equation means that if (A4.3) holds, then 


(A4.4) (yy ky —m) A'(keh) = 0 


should also hold, where the connection between A‘(k\) and A(k,) is 
established by a unitary operator U in spin-chirality space, 


(A4.5) A'(k)) = UA(ky); — A(hty) = U-* A(R) 

to be determined from the Lorentz transformation 

(A4.6) ky = Lyk; hy = Lyk 

which leaves 

(A4.7) Kk = kk, = m?® 

invariant. This means that 

(A4.8) Lyx Ly = 8yv5 Lyn = Lys Lyn LH = Sy: 
Note the convention 


(A4.9) 
-1 for p=v=1,2,3 


8 = 7 +1 for p=v 


i 
» 


so that A, 8uy = Ay. 
0 otherwise 
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Now write Eq. (A4.3) with the help of (A4.4) and (A4.6) as 

(A4.10) 

(y, Lah ky 0-1 —mU-") A'(kh) = 0 ine. (Ly, Uy, U-* ky —m) A’ (ke) = 0. 
This is identical with (A4.4) provided 

(A4.11) Ly,Uy,U-! = y, or Uy, U = Lay yy. 


This equation is true for all Lorentz transformations, including the 
improper ones. The main interest here lies, however, in the proper 
Lorentz transformations, i.e. those whose determinant is +1 and for 
which L,, > 0, so that they may be thought of as evolving continuously 
from the identity transformation, and do not contain transformations 
reversing the sign of the energy. 

Now consider the case of the infinitesimal transformation, 


(A4.12) ky = kate ky with €xy = —ya 
which guarantees (A4.8). Thus 
(A4.13) 


Dy = 8yytery and Lyy Ly = 8yy8yu + 8dr eq + Ory €dv 
= byt eptepy = Oyp- 
A solution of Eq. (A4.11) will now be sought by putting 
(A4.14) 
U =I+(i/2)eyM,, with M,, = —-M,, 
where the M,,, are 4x 4 matrices numbered by the pair of indices (i.e. 


M,, is not a matrix element). There are thus six independent matrices 
M,,. Alternatively one may write 


pe 


(A4.15) U = I+te, 8, 

with 

(A4.16) 

€1 = €28 €2 = €31 €3 = C12 €4 = €14 €5 = €2¢ €6 = €94 


8S, = Mog S, = Mg, S3= My, Se= My Ss = Moy So = Mau 


Substituting this into Eq. (A4.11), keeping only terms linear in ¢,,, one 
obtains 


(A417) 
(#/2) €yly¥n Muy — My ya) = rv ¥v = B€yv(Buavv—9va Yu) 


The Invariance of Dirac’s Equation 329 


Tt is seen that, up to terms which commute with all y,, M,,, must satisfy 


(A4.18) Uy, Myy—Myyya) = Surv — Sa Yp- 
This is obviously solved by 
(A4.19) 
Myy = —(/4) (Yurv—Yv Yu) = Rams 
0 for p=v. 
Indeed, 
(A4.20) 


RAY pve Va) = HAY ere + YpVAaYy— Baru) = Sur%—Syr Yu» 


The C.R.s between the M,,, can now be established by straightforward 
computation: 


(A4.21) 

[Mids My) = 18x Marv + 8rv Miu — Bap Mier — Sey My): 
Writing 
(A4.22) [Mas Myr) = 4C ed, wv, po Mp0 


(the factor } is needed because the summation over p and o counts each 
term twice) one has for the structure constants C,.), ny, po 


(A4.23) 

Cyd, pr, po = US; 8pr Sov + Bry 8 pre Sop — Bau Spx Sav — Sev 5 pd Soy) 
so that 
(A4.24) 

trace (Mg My3) = Cup, uv, po Cpo,pv, 8 = 4(dxy 8as — 8x8 dpy)- 
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APPENDIX 5 


The Most General Canonical Transformation of a Pair 
of Fermion Operators 


Any unitary operator U is of the form 
(A5.1) Uses; Siais* 
so that any operator A transforms as 
(A5.2) A = & Ae’ = A+i[SA]+(i?/2!) [S[PSA]]+... 


Consider now two fermion operators « and 8, which together with their 
hermitean adjoints satisfy the anti-C.R.s 


(A5.3) fata} =I; {pt p} = I; all other { } = 0. 


For fermions which are completely specified by a momentum label k and 
a spin label s, the operators « and 8 may, for example, be identified with 
annihilation operators a(k,,s,) and a(ks,8,) provided the two sets of 
labels (k,,s;) and (ky,sy) are not identical. The anticommutation pro- 
perties of « and 8 allow one to form not more than eight linearly inde- 
pendent products of even order in the operators «,«*,8,8*, namely 


(A5.4) 
Pee mah Oh pe Bo et = at 8; vp = Bra; 4, = ate; 
ng = BYB; p= atBtoB = —n,ng = —ptp = vty —ng. 


These operators commute with any other such set made up out of 
operators «’, 8’ belonging to a different set of fermion labels (k’,s’). 
Therefore, the most general hermitean operator S which will guarantee 
that the various operators U,U’ belonging to different sets (k,s), (k’,s’) 
commute, will be of the form 


(A5.5) 
S = Cote:pt+cop+egvt +cqv-+C5 My +Cymp+Crp 
where the coefficients c; must still satisfy certain reality conditions so that 


S* =S as required by (A5.1). 
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This representation can be simplified if new linear combinations are 
introduced, a simplification to which one is led by the following theorem 
of Koppe and Mihlschlegel: 

Let A be an operator such that 
(A5.6) A*=0 and AAtA=A 
then the operators 


(A5.7) 

2; = At+A; 2, = i(A+—A); 2, = AAt—AtA; 2X) = AtA+AAt 
satisfy the relations 

(A5.8) 

2,3; = 12, (i,j, koyel.); ZF = 2%; 22) = 23, = 2; A= Jp. 
The operators (A5.7) are thus isomorphic to the Pauli matrices and the 
unit matrix. Note, however, that 2) need not be identical with the unit 
operator J. 


The premises of this theorem are satisfied for A = » and _A =v. There- 
fore the following sets of operators satisfy separately the relations (A5.8): 


(A5.9) oy = petty; 
og = i(ut—p); 
og = wet —pt pe = I-ng—ng; 


oo = et pt+ppt = I—n,—ng+2n, ng 
(A5.10) 1 = vt ty; 
T2 = i(vt—y); 


pe are es 


T3 = wh—vty = ng—N,; 


To 


vtv+wt = n,+ng—2n, ng. 
One has, in addition, the relations 
(A5.11) ott, =I; AT) = T9% = 0 


so that, because of (A5.8) all products between operators o and operators + 
vanish. 


The most general unitary operator can therefore be written 
(A5.12) U = ete git otters piso 


where (to,t, 89,8) are eight real coefficients. 
The C.R.s between the operators o, 7, and the various product operators 
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made up out of «, 8, «*, B* allow one to sum the series (A5.2), representing 
the various transformations, in closed form. By expending some labor 
one finds, for example, the following general transformation formulae: 


(45.13) 
& = A, AD + 2iCng) «— A, BD + 2iCn,) B 
— B, A(D* — 2iCn,) B* — B, By D* — 241Cng) act 
(A5.14) 
B = A, BR(D + 2iCng) «+A, Af(D + 2iCn,) B 
— B, Bi(D* — 2iCn,) B* + B, Aft (D* — 210g) a+ 


where 

(A5.15) A, = cos|s| +(isg/|s|)sin|s| = A*, 

(A5.16) B, = [i(81+¢82)/|s|]sin |s| = —B_, 
(A5.17) C = O(s9—to) = sin (to—So) C(to — 80) 
(A5.18) D = D(s—to) = ef) = D*(ty—89). 


The transformation of Bogoliubov [Section 30, Eqs. (30.66)] is con- 
tained in (A5.13) and (A5.14) as a special case. By putting t = 0, tp = 0, 
89 = 0, 8; = 83 = 0 and letting only s, 4 0 one has A, = cos(s2), A, = 1, 
B, sin (s,), B, = 0, C = 0, D =1, so that 
(A5.19) & = COS (Sy) «+sin (8g) BT 
(A5.20) B = cos (sy) B—sin (82) «+ 
This transformation coincides with Bogoliubov’s transformation if one 
makes the identifications 


(A5.21) 
a=a(k,1);  B=a(—k,2); — w(k) = cos(s.); (kk) = —sin (sy) 
with the understanding that s, depends on |k| only so that the relations 
(30.68) are satisfied. 

Another special case contained in the transformations (A5.13) and 


(A5.14) is the operator (27.49) representing the transposition of particle 
labels. The conditions reducing these transformations to 


(A5.22) &@&=B and B=a 

are obviously 

(A5.23) 

C=0, D=1, B,=0, 4,=0, —A,B, = A,B =1, 
or A, = B, =. 
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They can be satisfied by putting, 
(A5.24) 

8S =to=7, 8 = 8 = 0, 88 = 7/2, ty = 7/2, to = ty = 0. 
Then [using (A5.9)-(A5.11)] 
(A5.25) 

U = elitr gilml2) 7 gimae giln/2) ox — gim(rvta») pilm/2) 7: gilm/2) ox 

—i exp [i(er/2) (art B+ B* «)] exp [i(m/2) (Im —ng)] 
coincides with the operator 7’, (27.50) if one makes the identifications 
(A5.26) a(l) = « and a(2) = B. 


It is often convenient to have an explicit expression for the operator U, 
which can be evaluated by using 


(A5.27) 
(so)? = 8709; opG¢=6; of = 09; (tt)? =? 79; mor =; T=T9. 
One has 
(A5.28) 
ef = 1+ (i?/2!) 8? a9 + (14/4!) 8409+... +%(sa) + (19/3!) 8%(so) +... 
= 1—o9(1—cos |s|) +¢[(se)/|s|] sin |s| 


and 
(A5.29) 
eit = 14 i890q+ (12/2!) s8og-+... = 1—o9(1—e%) 
and completely analogous expressions for e“* and e“7*, so that 
(A5.30) 
U = {1—r9(1—e™)} {1 —79(1 — cos |t|) + 4[(te)/|t]] sin |t]} 
x {1 —o9(1—e")} {1 —o9(1 —cos |s|) +4[(se)/|s]] sin |s]}. 


Bogoliubov’s transformation is obtained from this as the special case 


(A5.31) —sin (82) = v(k); 08 (82) = u(k) 

with all other s,¢ vanishing: 

(A5.32) Uz = 1—o9[1—Uu(k)]—io, v(k). 
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——_—— KK __________ APPENDIX 6 


The Delta Function and its Application to Phase Space 
Considerations 


For a scalar variable « the delta function 5(«) is defined as the integral 


+0 
(A6.1) 8(«) = x i ei dt, 


(se) 


fig 
8 


Fig. A6.1. The function [sin (@7)]/7« for large T’. 
For the purpose of visualization (see Fig. A6.1), this may be thought of 
as the limit 

rT 


+ 
(A6.2) 8(«) = lim = J edt = lim any 

To 27 
-7 


It has the properties 


Cy 


+ + 
(A6.3) g(a) = J g(a’) 8(a—«")de’ and J 8(«) do = 1, 
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Alternatively, one may think of 5(«) as the limit (see Fig. A6.2) 


7 (sin(o 2)" 
Fat (aP/2) 


2 
Oo ¥ 


Fra. AG.2. The function [7/27] (sin 2(o7'/2)/(a7'/2))? for large T. 


+7/2 

eu al? a. 2 (sin (aL]2)\2 

P= saci vio = Pe bscuns, \reath Lad.)| |e 

tiie an ol 7 ) sila es i (eT) ) 
—T/2 


Comparing (A6.4) with (A6.2) one finds the relation 


l +0 
(A6.5) [3(a)* = 5-3(a) ; J dt. 
For a four-vector P one defines similarly 

1 i 

(A6.6) iP) = Bayt i elt d4x = 8(P)8(Po) 
and has 

1 
(A6.7) [8(P)]? = amy dP) f ate. 


Of great usefulness for applications of the 5 function is the following 
general formula governing the 6 function having a function /(«) as its 
argument: 


XD [8(a—a,)] 
(A6.8) 8[f(«)] = ~“Taiday 


where «, are the simple roots of f («) = 0 in the region under consideration. 
To prove this divide the region of integration into small intervals so 
that in each interval there is only one root of f(«) = 0. For the root «,, 
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in particular, let this interval be «,—«, < « < «,+¢,. One has then in this 
interval the expansion [using f(«,) = 0 by definition, and f’(«,) 4 0 if «, 
is a simple root] 

(A6.9) F(a) = flere) = f'(op)e+ (1/2) f"(a)eP +... 

and thus 


arte +f'(arler+.. 
(A6.10) i} g(x) S[,f (a) dex 


ar — Er —S(arlert.. 


a(fdf 
ate \ a flde: 


Io) p 
F(a) for f'(a%) > 0 


- for f’(a,) < 0 


from which the formula (A6.8) follows by summation. Equation (A6.10) 
is based on the understanding that ¢, is chosen such that the signs of the 
integration limits are determined by the term linear in ¢,. 

If one takes, for example, f(«) = «*—c® so that there are two roots 
«= +c, one has the special case 


(A6.11) 3(a2—c2) = ae 2 ¢) +8(a-+c)]. 


As an application consider the Compton effect and describe it in a 
laboratory frame of reference in which, initially, the electron is at rest, 
k=0, Q=m, and one photon of energy w = |x| is present. With a 
normalization of states to one particle in a volume V, w,,V is the 
transition probability per scattering center and unit time, where w,-, is 
defined by Eq. (23.22). The incident intensity J in photons per unit area 
and time is equal to the number of incident photons per unit volume 
(namely, 1/V with the normalization adopted above) times the velocity 
of light (namely 1 in natural units), giving J = 1/V. Thus the scattering 
cross section per scattering center into a fixed final state 7’ is 


(Wy, ViL) = wy, V*. 


In any actual experiment one rarely discriminates a specific final state 
7’, but rather a set of final states with final momenta of electron and 
photon ranging between k’ and k’ + dk’, and x’ and x’ + dx’, respectively. 
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The number of such states, for definite spin and definite polarization, is* 


(A6.12) dN = [V?/(2z)®]dk' dx’. 

The cross section per scattering center and unit time into this set of final 
states is therefore 

(A6.13) do = wy, V7dN. 

If the detection apparatus responds only to photons scattered into a 
solid angle dS oriented in direction #,, one has to integrate over dk’ 
and dw’, keeping # and constant. Since dx’ = w'*dw'dS one obtains 
for the cross section per scattering center encountered by photons 
scattered into dS 


(A6.14) odS = [V4d8/(27)°] J J wy dk! wo! dos’. 

Since w,,, contains a 8 function 6(P’ — P) = 5(k’ +x’ —x), the integration 

over dk’ can be carried out immediately, resulting in replacement of k’ 

everywhere by 

(A6.15) k’ = x-»’ 

in accordance with conservation of momentum. As a consequence of this, 

the energies Q, Q', w, w', appearing in the remaining 5 funetion 
8(Pp—Po) = 8(Q’ +w’ -—Q—-w) 

become interdependent, and this has to be kept in mind when the integra- 

tion over dw’ is carried out. Writing 

(A6.16) (Q' +w'-Q-w) = f(w') 

and using Eq. (A6.8), one may write (A6.14) as 

Vids |<r'|R|7>|? w!? 8(w’ — w;) dw’ 


Ola). 088 = Gant [taf (o"y/do"D 
_ Vids |<r'|R|7> |? wo’ 
. Cay ay eT) wee 
The “resonance value” w, of w’ follows from 


(A6.18) 
Lf (@'Yurmey = (2! +0,- Qe) = [VP +k? +0,-—m—a] 


* If V is a cube whose sides are of length L, then possible values of k are k; = ,(27/L) 
with n, integer. The number of integers whose corresponding vectors k lie between kand 
k-+dk is equal to (L/2)8dk,dkedkg =[V|(27)%]dk. Thus any sum (1/V) X-+. can, in 
the limit V—+©, be replaced by an integral [1/(2z)3] f ...dk. 
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where # is the angle of scattering between x and x’, so that k’? = (x —x’)® 
= «> +K'?— 2k’ cos) = w? + w'®— 2ww’ cos#. The result is 


mw 


(A6.19) Or = m+ w(1 — cos) 


and one has also 
(A6.20) 


[flo /de'Iarnea, = ( 


7 


w’—awcost ) mu 
- Zz 
w= wy" , Wy 


Vm? + w+ w'2—2ww’ cosh 
where 2; is the energy of the scattered electron at resonance, 
(A6.21) Q = (Vm? +k"), 20, = Vm? + w® + 07? — 2ww, cos). 


The expression for the differential cross section (A6.17) can therefore be 
cast in the form 


(A6.22) odS = [V*dS|(2n)'9] |<r'|R|7>|2 [e078 2 [mo] 


with the understanding that the matrix element has to be evaluated with 
the resonance value (A6.19) and (A6.21) for the energies in the final state 
and under observation of conservation of momentum (A6.15). 

Finally, if the target and the incident photon beam are unpolarized, 
and if one does not observe the polarizations of either scattered electrons 
or scattered photons, the differential cross section must be averaged over 
initial spins and polarizations, and summed over final spins and polariza- 
tions, giving (since only transverse photons are involved) 


(46.23) <odS) ay = 


NOTES 


The 6 function was introduced into physics by Dirac [1]. For a more 
detailed description of the 6 function see Iwanenko and Sokolow [2]. 
A complete treatment of the Compton effect is contained, for example, 
in Mandl [3]. 
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APPENDIX 7 


If Galileo Had Known Quantum Mechanics 


Galileo would presumably have been keenly interested in the trans- 
formation properties of the time-dependent ys function Y(q,t) of a free 
particle of mass m, satisfying 


(A7.1) i(aV/at)—(1/2m)V?¥ = 0;  V = (d/0q), 
under the general transformation bearing his name, 
(A7.2) q>q’ = Rq+Vi+a; t+t' =t+b, 


where R is a constant orthogonal matrix representing a spatial rotation, 
V a constant vector representing a pure Galileo transformation, a a 
constant vector representing a displacement in space, and 6 a constant 
representing a displacement in time. 

Denoting the set of numbers (f,V,a,6) by G, the transformation 
(A7.2) should be representable by a unitary operator U(G) so that, up 
to some phase factor, |q(t)>’ = U(@) |q(t)> = |q’(t’)> is the transformed 
state. Accordingly, the state can be characterized in the transformed 
frame by a y function ¥’(q,t) which differs from the untransformed 
function taken at the transformed point at most by a phase factor, 


(A7.3) W'(q,t) = FO Wq',t’). 
Invariance under Galileo transformations means Y’ and Y must satisfy 
the same Schroedinger equation. Thus 
(A7.4) i(0Y” Ot) —(1/2m) V2? Y" = 0. 
This equation imposes conditions on the phase function f. Using the 
relations, following from (A7.2) and the orthogonality of R, 
(A7.5) 
(d/at) = (8/dt’) + V-V'; Vv = RV’ so that Ve = Vt 
one can write (A7.4) upon substitution of (A7.3) 
(A7.6) [—(0f/at')—V-V' f+ (1/2m) (V' f)?— (i/2m) V2 fel Y 
+i[V—(1/m)V' fletV' ¥ 
+ [i(O/dt') —(1/2m) V2 le" = 0. 
341 
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The last term vanishes on account of the Schroedinger equation (A7.1) 
(which must hold for any q and ¢), and since Y and V’ ¥ are linearly 
independent, one has the two conditions 

(A7.7) Vif = mV 

(A7.8) (Af/dt’) = —V-V' f+ (1/2m) (V' f)?—(i/2m) Vf. 
Substituting (A7.7) into (A7.8) and keeping in mind that V is a constant 
vector, one finds 

(A7.9) (0f/at’) = —kmV?. 

Equations (A7.7) and (A7.9) can be integrated immediately, yielding 


(A7.10) 

S(q',t') = mV-q'—4mV?t' +0 (C a constant). 
The remarkable conclusion to be drawn from this result is that the phase 
factor f cannot, in general, be eliminated by judicious choice of the integration 
constant C. This has a profound consequence, first noticed by Bargmann, 
namely: 

It is impossible to have in nonrelativistic quantum mechanics states which 
are linear superpositions of states describing particles of different masses. 
This means one cannot grasp in nonrelativistic quantum mechanics states 
with a mass spectrum, or states describing unstable elementary particles. 

To see this, consider a linear superposition 


(A7.11) Y= P+, 
where Y, and Y, transform according to (the constant phase C has been 
put equal to zero for simplicity’s sake) 
(A7.12) 

(qt) = {exp [tm,(V-q'—4V*t')]} ¥.(q',t'); (w= 1,2). 
Now perform the following sequence of transformations, amounting to 
the identity, 


(A7.13) 
G, = G,4;,4,G4, = (I, —V,0,0) (I, 0, —a, 0) (Z, V, 0, 0) (I, 0, a, 0) 
= (I,0,0,0) 
corresponding to a sequence of coordinates and velocities 
(A7.14) 
qs = 45 Qs=a+Vi; g2=q+Vit+a; gqi=qta 
t,=¢; ts =¢; te = t; hat 


Vv, = —V; V; = 0; Vo = Vs Vi = 0 
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giving rise, for each m,, to a sum of phases 
(A7.15) 


4 
My Z (Vy 4054) = malV-(4a— 40) — V2e] = Mg Voe 


so that the transformed y function becomes 


(A7.16) 
wT — U(G,) = [exp (im, V-a)] ¥i+[exp (im, V-a)] Po. 


This means, a transformation amounting to the identity can affect the 
norm of the superposition (A7.11). In other words, the relative phase of 
two # functions describing particles of different mass is completely 
arbitrary if one demands Galileo invariance. To avoid inconsistency one 
must conclude that a superposition of the type (A7.11) is without mean- 
ing, and that there can exist no operators which sponsor transitions 
between states characterized by different masses m, and my. This amounts 
to existence of a superselection rule which guarantees the strict conservation 
of mass in nonrelativistic quantum mechanics. 

It should be noted that this conclusion is not valid in relativistic quantum mechanics. 
Consider, for example, the equation governing the y; function of a spinless particle 
(A7.17) D2p—m2y = 0 
which is invariant under the inhomogeneous Lorentz transformations 


(A7.18) ax’ = Lat+u 


where L is a constant Lorentz matrix and u a constant four vector. Demanding that 


(A7.19) yi (x) = el f(a’) 
satisfy the same equation 
(A7.20) Oy’ —miy’ = 0 


leads, because of 

(A7.21) O=L0' sotht O?=0%, 

to the equation 

(A722) [i0®g—D' gO’ gle p+ 2101 gD pe” +102 p—m? ye” = 0. 

The last term vanishes on account of (A7.17), and the remaining conditions are 
(A7.23) O'g = 9; i0”"9-O'9O'g = 0, 

which require 

(A7.24) g = constant. 


It is perhaps instructive to set down here the alternative treatment of 
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the Galileo transformation in momentum-energy representation, in 
which a free particle of momentum k and energy w is characterized by the 
¥ function 


(A7.25) @(k,w) = J ellot—k-al Yq, t) dq dt 
satisfying the Schroedinger equation 
(A7.26) [w—(k?/2m)] Bk, w) = 0. 


The transformed function is then given by 


(A7.27) ®'(k,w) = J ellwt—k-al W'(q, t) dq dt. 
Using 
(A7.28) q = R-\(q'—Vt' +Vb—a); t=t'-b, 


and the transformation formulae (A7.3) and (A7.10), the integrand can 
be expressed in terms of the transformed coordinates q’,t’, 


(A7.29) 


©'(k,w) = J exp {i[w(t’ —b) —k- R-4(q/ — Vt’ + Vb—a) 
+mV -q'—4mV?t' +C)} Pq’, t’) dq’ dt’. 

Extracting the terms not containing q’,t’ from under the integral, 
dropping the primes on the integration variables, and using the ortho- 
gonality of R by writing k-R~'a = Rk-a, etc., one obtains 

(A7.30) 

©'(k, w) = exp[i(—wb + Rk-a—bRk-V+C)] J exp [i(w’t—k’-q)] 
W(q,t)dqdt 


with 
(A7.31) w! = wt+V-Rk+4mV?; k’ = Rk+mV. 


The quantities w’ and k’ can be called the transformed energy and 
momentum, because they satisfy 


(A7.32) w'? —(k'?/2m) = w?—(k?/2m). 
Expressing w and k in terms of w’ and k’, 

(A7.33) w = w'—V-k'+4mV?; k = R-'(k'—mV), 
one can write the transformation formula (A7.30) 


(A7.34) 
®'(k, w) = exp {i[(b/2)mV*2—mV-a+C]} exp [i( —bw’ +a-k’)] O(k’, w’). 
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As in case of the coordinate representation, the phase factors cannot be 
eliminated entirely by judicious choice of the constant C. The sequence 
of transformations (A7.13), corresponding to 


(A7.35) 
k, = E; ks = k+mV; k, = k+mV; k, =k 
a, = 0; ag = —a; a, = 0; a =a 
by = 0; bs = 0; b. = 0; 6, =0 
give once again, for each m,, rise to a change in phase 
(A7.36) Daj k; = a,-k, +a3"ks = —m,a-V 
J 


leading again to Bargmann’s superselection rule. 

A representation for the generator u of the pure Galileo transformation 
characterized by the parameter V, so that in momentum-energy 
representation 


(A7.37) |k, w>7 = |k’,w’> = U(V)|k,w> = eV "|k, o>, 
can be obtained by demanding, in accordance with (A7.31), 


(A7.38) 
k’ = ¢k’,w'|P|k’,w’> = <k,w|P|k,o>+mV = k+mV 

and 

(A7.39) 

w' = ¢k’,w’|A|k’, w’> 


dk, «| H|Ik, w) + Ck, o|V-P|k, o> +4mV2 
w+V-k+4mV*. 


Using the expansions 

eWVupevu — P_i(V-[u)P]+... 
(A7.40) 

eu eV — H—i(V-[u) A] + (2/2!) (V-[u) (V-[u) AyJ—- +... 
one has then the requirements 
(A7.41) —i(V-[u)P] = mV 
(A7.42) 

—i(V-(u)H] = V-P; —(i/2) (V-[u)(V-P)] = $mV?. 

The second equation (A7.42) is identically satisfied if (A7.41) holds, and 
for the unknown operator u there remain the C.R.s 
(A7.43) (u;Py] = ims; (uH]] = :P, 
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which can be solved by putting 
(A7.44) u = im(0/dP) +iP(0/0H). 


The general Galileo transformation is thus associated with the 10 
operators 


J=L+8=(QxP)+S = —i(Px 0/dP)+S8, of total angular momen- 
tum, generating the rotations R, 


u = im(0/dP) +iP(0/0H), which can be identified as coordinate operator 
Q = mu, generating the pure Galileo transformations V, 


P, of linear momentum, generating the displacements in space a, and 
H, of energy, generating the displacements in time b. 
They satisfy the C.R.s 


(A7.45) 
[Jide] = WJ (cyel.); [Jiuz] = tug (eyel.); 
[uj %] = 0; 
(iP2] = iP; (oycl.); [Je H] = 0 
(u;P;] = im by; [u,H] = iP, 
[PjPr] = 9; [P,H] = 0 
[HH] = 0 


It is interesting to note that the algebra engendered by the Galileo 
transformation admits the two invariants 


(A7.46) 
P?—2mH = 2mE and = [mJ—(uxP)]}* = m?S*? = m*s(s+1), 


where # and S can be interpreted as the intrinsic energy and intrinsic 
angular momentum of the particle under consideration. At this point 
there becomes apparent the feasibility of a nonrelativistic quantum 
mechanics of particles of mass m= 0, which are characterized by two 
invariants, P? and (ux P)?, with u=iP(0/dH). Further details of this 
intriguing aspect of invariance under Galileo transformations can be 
found in the references listed below. 
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